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[3£^!<7)£i#] Video Text Processing Apparatus 

1 . A text change frame detection apparatus that selects 
a plurality of video frames including text contents from 
given video frames, characterized in that said apparatus 
comprises : 

first frame removing means for removing redundant 
video frames from the given video frames; 

second frame removing means for removing video 
frames that do not contain a text area from the given 
video frames; 

third frame removing means for detecting and 
removing redundant video frames caused by image shifting 
from the given video frames; and 

output means for outputting remaining video frames 
as candidate text change frames. 

2. A text extraction apparatus that extracts at least 
one text line region from a given image, characterized in 
that said apparatus comprises: 

edge image generation means for generating edge 
information of the given image; 

stroke image generation means for generating a 
binary image of candidate character strokes in the given 
image by using the edge information; 

stroke filtering means for removing a false stroke 
from the binary image by using the edge information; 

text line region formation means for combining a 
plurality of strokes into a text line region; 
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text line verification means for removing a false 
character stroke from the text line region and reforming 
the text line region; 

text line binarization means for binarizing the text 
line region by using a height of the text line region; and 

output means for outputting a binary image of the 
text line region . 

3 . A program for a computer that selects a plurality of 
video frames including text contents from given video 
frames, characterized in that the program directs the 
computer to perform a process comprising: 

removing redundant video frames from the given video 
frames; 

removing video frames that do not contain a text 
area from the given video frames; 

detecting and removing redundant video frames caused 
by image shifting from the given video frames; and 

outputting remaining video frames as candidate text 
change frames . 

4 . A program for a computer that selects a plurality of 
video frames including text contents from given video 
frames, characterized in that the program directs the 
computer to perform a process comprising: 

determining whether two image blocks in the same 
position in two video frames of given video frames are a 
valid block pair that has an ability to show a change of 
image contents; 

calculating a similarity of two image blocks of the 
valid block pair and determining whether the two image 
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blocks are similar; 

determining whether the two video frames are similar 
by using a ratio of a number of similar image blocks to a 
total number of valid block pairs; and 

outputting remaining video frames after a similar 
video frame is removed, as candidate text change frames. 

5. A program for a computer that selects a plurality of 
video frames including text contents from given video 
frames , characterized in that the program directs the 
computer to perform a process comprising: 

generating a first binary image of a video frame of 
the given video frames; 

determining a position of a text line region by 
using a horizontal projection and a vertical projection of 
the first binary image; 

generating a second binary image of every text line 
region; 

determining validity of a text line region by using 
a difference between the first binary image and the second 
binary image and a fill rate of a number of foreground 
pixels in the text line region to a total number of pixels 
in the text line region; 

confirming whether a set of continuous video frames 
are non-text frames that do not contain a text area by 
using a number of valid text line regions in the set of 
continuous video frames; and 

outputting remaining video frames after the non-text 
frames are removed, as candidate text change frames. 

6. A program for a computer that selects a plurality of 
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video frames including ' text contents from given video 
frames, characterized in that the program directs the 
computer to perform a process comprising: 

generating binary images of two video frames of the 
given video frames; 

determining a vertical position of every text line 
region by using horizontal projections of the binary 
images of the two video frames; 

determining a vertical offset, of image shifting 
between the two video frames and a similarity of the two 
video frames in a vertical direction by using correlation 
between the horizontal projections; 

determining a horizontal offset of the image 
shifting and a similarity of the two video frames in a 
horizontal direction by using correlation between vertical 
projections of every text line in the binary images of the 
two video frames; and 

outputting remaining video frames after a similar 
video frame is removed, as candidate text change frames. 
7. A program for a computer that extracts at least one 
text line region from a given image, characterized in that 
the program directs the computer to perform a process 
comprising: 

generating edge information of the given image; 

generating a binary image of candidate character 
strokes in the given image by using the edge information; 

removing a false stroke from the binary image by 
using the edge information; 

combining a plurality of strokes into a text line 
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region; 

removing a false character stroke from the text line 
region and reforming the text line region; 

binarizing the text line region by using a height of 
the text line region; and 

outputting a binary image of the text line region. 
8. A program for a computer that extracts at least one 
text line region from a given image, characterized in that 
the program directs the computer to perform a process 
comprising : 

generating an edge image of the given image; 
generating a binary image of candidate character 
strokes in the given image by using the edge image; 

checking an overlap rate of a contour of a stroke in 
the binary image of the candidate character strokes by 
pixels indicating an edge in the edge images- 
determining that the stroke is a valid stroke if the 
overlap rate is greater than a predefined threshold and an 
invalid stroke if the overlap rate is less than the 
predefined thresholds- 
removing the invalid stroke; and 

outputting information of remaining strokes in the 
binary image of the candidate character strokes. 

Technical Field to which Invention Belongs 

The present invention relates to a video image 
processing apparatus, more specifically to a text image 
extraction apparatus for e-Learning video. The text change 
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frame detection apparatus locates the video frames that 
contain text information. The text extraction apparatus 
extracts the text information out of the video frames and 
send the extracted text information to an optical 
character recognition (OCR) engine for recognition. 

Prior Art 

Text .retrieval in video and image is a very 
important technique and has a variety of application,, such 
as storage capacity reduction, video and image indexing, 
and digital library, etc. 

The present invention focuses on a special type of 
video - e-Learning video, which often contains a large 
amount of text information. In order to efficiently 
retrieve the text content in the video, two techniques are 
needed: text change frame detection in video and text 
extraction from image. A text change frame is a frame that 
marks the change of text content in a video. The first 
technique fast browses the video and selects those video 
frames that contain text area. The second technique then 
extracts the text information from those video frames and 
sends them to an OCR engine for recognition. 

Text change frame detection technique can be 
regarded as a special case of scene change frame detection 
technique. The techniques for detecting the scene change 
frame that marks the changes of the content in video from 
a plurality of frames in a video have been studied 
actively in recent years. Some methods focus on the 
intensity difference between frames, some methods focus on 
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the difference of color histogram and the texture. However, 
these methods are not suitable for text change frame 
detection in video, especially in e-Learning field. 

Take presentation video - a typical e-Learning video 
as example, in which the video frame often contains a 
slide image. Examples of slide image include the 
PowerPoint® image and the film image from a projector. 
...The change of the content of slide will not cause a 
dramatic change in color and texture. Also, the focus of 
the video camera often moves around in a slide image 
during the talk, which causes image shifting. Image 
shifting also occurs when the speaker moving his or her 
slides. These content shifting frames will be marked as 
scene change frames by conventional methods. Another 
drawback of the conventional method is that they can not 
tell directly whether a frame contains text information. 

Another way to extract text change frame from video 
is performing text extraction method on every frame in the 
video and judging whether the content has been changed. 
The problem of such strategy is that it is very time 
consuming. 

After the text change frames are detected, a text 
extraction method should be used to extract the text lines 
from the frames. Many methods are proposed to extract the 
text lines from video and static image (see non-patent 
documents 1 and 2, for example) . 

Also, some patents related to this field have been 
published (see patent documents 1, 2 and 3, for example) . 

These methods will meet problems when deal with 
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video frame in e-Learning . The characters in e-Learning 
video image always have very small size f also the 
boundaries of these characters are very dim, and there are 
many disturbances around the text area, such like the 
bounding box of text line, the shading and occlusion of 
human body, etc. 

Non-Patent Document: 1 _.__ = . _ _ .„-__.. 

V. Wu, R. Manmatha, and E. M. Riseman, "TextFinder: An 
Automatic System to Detect and Recognize Text in Images," 
IEEE transactions on Pattern Analysis and Machine 
Intelligence, VOL. 21, NO. 11, pp. 1224-1229, November, 
1999. 

Non- Patent Document 2 

T. Sato, T. Kanade, E. Hughes, M. Smith, and S. Satoh, 
"Video OCR: Indexing Digital News Libraries by Recognition 
of Superimposed Captions," ACM Multimedia Systems Special 
Issue on Video Libraries, February, 1998. 

Patent Document 1 

U.S. Patent No. 6,366,699, specification. 
Patent Document 2 

U.S. Patent No. 5,465,304, specification. 
Patent Document 3 

U.S. Patent No. 5,307,422, specification. 
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Problems to be Solved by Invention 

However , there are the following problems in the 
above mentioned conventional video image processing. 

It is very time consuming to perform text extraction 
method on every frame in the video and judge whether the 
content has been changed. 

The characters in e-Learning video image always have 
very small size, also the boundaries of these characters., 
are very dim, and there are many disturbances around the 
text area. Therefore, the conventional text extraction 
method will leave many false character strokes in the 
final binary image, which give a wrong recognition result 
in the following OCR stage. 

It is an object of the present invention to select 
the candidate text change frames from a plurality of video 
frames in a fast speed, while keeping a high recall rate, 
which is defined as the rate of the number of extracted 
correct text change frames to the total number of correct 
text change f rames . 

It is another object of the present invention to 
provide a scheme for efficiently detecting the text region 
in the text change frame, removing as much as possible the 
false character strokes, and providing a binarized image 
for every text line. 

Means for Solving Problems 

The above objects are fulfilled by a video text 
processing apparatus for fast selecting from ail frames in 
a video those frames that contain text contents, marking 
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the region of each text line in the text frame and 
outputting the text line in a binary form, comprising a 
text change frame detection apparatus for fast selecting 
text frames in the video and a text extraction apparatus 
for extracting the text lines in the text frame. The 
binary form is, for example, represented by black pixels 
corresponding to background and white pixels corresponding 
to character strokes. 

The first text change frame detection apparatus 
comprises first frame removing means, second frame 
removing means, third frame removing means and output 
means, and selects a plurality of video frames including 
text contents from given video frames. The first frame 
removing means removes redundant video frames from the 
given video frames. The second frame removing means 
removes video frames that do not contain a text area from 
the given video frames. The third frame removing means 
detects and removes redundant video frames caused by image 
shifting from the given video frames. The output means 
outputs remaining video frames as candidate text change 
frames . 

The second text change frame detection apparatus 
comprises image block validation means, image block 
similarity measurement means, frame similarity judgment 
means and output means, and selects a plurality of video 
frames including text contents from given video frames. 

The image block validation means determines whether 
two image blocks in the same position in two video frames 
of given video frames are a valid block pair that has an 
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ability to show a change of image contents. The image 
block similarity measurement means calculates a similarity 
of two image blocks of the valid block pair and determines 
whether the two image blocks are similar. The frame 
similarity judgment means determines whether the two video 
frames are similar by using a ratio of the number of 
similar image blocks to the total number of valid block 
pairs. The output means outputs remaining video frames 
after a similar video frame is removed, as candidate text 
change frames. 

The third text change frame detection apparatus 
comprises fast and simple image binarization means, text 
line region determination means, rebinarization means, 
text line confirmation means, text frame verification 
means and output means, and selects a plurality of video 
frames including text contents from given video frames. 
The fast and simple image binarization means generates a 
first binary image of a video frame of the given video 
frames. The text line region determination means 
determines a position of a text line region by using a 
horizontal projection and a vertical projection of the 
first binary image. The rebinarization means generates a 
second binary image of every text line region. The text 
line confirmation means determines validity of a text line 
region by using a difference between the first binary 
image and the second binary image and a fill rate of the 
number of foreground pixels in the text line region to the 
total number of pixels in the text line region. The text 
frame verification means confirms whether a set of 
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continuous video frames are non-text frames that do not 
contain a text area by using the number of valid text line 
regions in the set of continuous video frames. The output 
means outputs remaining video frames after the non-text 
frames are removed, as candidate text change frames. 

The fourth text change frame detection apparatus 
comprises fast and simple image binarization means, text 
line vertical position determination means , .vertical 
shifting detection means, horizontal shifting detection 
means and output means, and selects a plurality of video 
frames including text contents from given video frames. 
The fast and simple image binarization means generates 
binary images of two video frames of the given video 
frames. The text line vertical position determination 
means determines a vertical position of every text line 
region by using horizontal projections of the binary 
images of the two video frames. The vertical shifting 
detection means determines a vertical offset of image 
shifting between the two video frames and a similarity of 
the two video frames in a vertical direction by using 
correlation between the horizontal projections. The 
horizontal shifting detection means determines a 
horizontal offset of the image shifting and a similarity 
of the two video frames in a horizontal direction by using 
correlation between vertical projections of every text 
line in the binary images of the two video frames. The 
output means outputs remaining video frames after a 
similar video frame is removed, as candidate text change 
frames . 
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After the candidate text change frames in the video 
are detected by the text change frame detection apparatus, 
the image of every frame is then sent to the text 
extraction apparatus for text extraction. 

The first text extraction apparatus comprises edge 
image generation means, stroke image generation means, 
stroke filtering means, text line region formation means, 
text line verification means A text line binarization means 
and output means, and extracts at least one text line 
region from a given image. The edge image generation means 
generates edge information of the given image. The stroke 
image generation means generates a binary image of 
candidate character strokes in the given image by using 
the edge information. The stroke filtering means removes 
the false strokes from the binary image by using the edge 
information. The text line region formation means combines 
a plurality of strokes into a text line region. The text 
line verification means removes a false character stroke 
from the text line region and reforms the text line region. 
The text line binarization means binarizes the text line 
region by using a height of the text line region. The 
output means outputs a binary image of the text line 
region . 

The second text extraction apparatus comprises edge 
image generation means, stroke image generation means, 
stroke filtering means and output means, and extracts at 
least one text line region from a given image. The edge 
image generation means generates an edge image of the 
given image. The stroke image generation means generating 
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a binary image of candidate character strokes in the given 
image by using the edge image. The stroke filtering means 
checks an overlap rate of a contour of a stroke in the 
binary image of the candidate character strokes by pixels 
indicating an edge in the edge image, determines that the 
stroke is a valid stroke if the overlap rate is greater 
than a predefined threshold and an invalid stroke if the 
overlap rate is less than the predefined threshold, and 
removes the invalid stroke. The output means outputs 
information of remaining strokes in the binary image of 
the candidate character strokes - 

After the text line regions are extracted by the 
text extraction apparatus, they are sent to an OCR engine 
for recognition. 

Embodiments of Invention 

The embodiments of the present invention are 
described below in detail by referring to the drawings. 

Fig. 1 shows the configuration of the video text 
processing apparatus according to the present invention. 
The input of the apparatus is an existing video data 101 
or living video stream from a television (TV) video camera 
102, the input video data is first decomposed into 
continuous frames by a video decomposition unit 103. Then 
a text change frame detection apparatus 104 is used to 
find the candidate text change frames in the video frames. 
The text change frame detection apparatus will greatly 
reduce the total processing time. After that, a text 
extraction apparatus 105 is enforced on every candidate 
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text change frame to detect text lines (text areas) in the 
frames and output the images of the text lines to a 
database 106 for further OCR processing. 

Fig. 2 shows the processing flow chart of the video 
text processing apparatus shown in Fig. 1. A process in 
S201 is performed by the video decomposition unit 103, 
processes in S202 to S204 are performed by the text change 
, _ frame detection apparatus 1.04, and processes in S205 to 
S210 are performed by the text extraction apparatus 105. 

First the input video is decomposed into continuous 
frames (5201) . Then frame similarity measurement is 
performed to measure the similarity of two nearby frames 
(S202) . If the two frames are similar, then the second 
frame is removed. Next text frame detection and 
verification is performed to judge whether the remaining 
frames from the process in S202 contain text lines (S203) . 
If a frame does not contain a text line, the frame is 
removed. Image shifting detection is further performed to 
determine whether image shifting exists in two frames 
(S204) . If so, the second frame is removed. The output of 
the text change frame detection apparatus 104 is a group 
of candidate text change frames. 

For every candidate text change frame, edge image 
generation is performed to generate the edge image of the 
frame (S205) . Then stroke generation is performed to 
generate the stroke image based on edge information (S206) . 
Next stroke filtering is performed to remove false strokes 
based on edge information (S207). Text line region 
formation is further performed to connect individual 
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strokes into a text line (S208) . After that, text line 
verification is performed to remove false strokes in a 
text line and re-form the text line (S209) . Finally, text 
line binarization is performed to produce the final binary 
image of the text line (S210) . The final output is a 
serial of binary text line images that will be processed 
by an OCR engine for recognition. 

. Fig . 3 shows the configuration of the text change 
frame detection apparatus 104 shown in Fig. 1. The input 
video frames are first sent to a frame similarity 
measurement unit 301 for deleting duplicate frames, then a 
text frame detection and verification unit 302 is used to 
check whether a frame contains text information. Next, an 
image shifting detection unit 303 is used to remove 
redundant frames that caused by image shifting. The frame 
similarity measurement unit 301, the text frame detection 
and verification unit 302 and the image shifting detection 
unit 303 correspond to the first, second and third frame 
removing means, respectively- The text change frame 
detection apparatus 104 is very suitable to detect text 
change frame in e-Learning video. It can remove duplicate 
video frames, shifting video frames as well as video 
frames that do net contain text area in a very fast speed 
while keeping a high recall rate. 

Fig. 4 shows the configuration of the frame 
similarity measurement unit 301 shown in Fig. 3. The frame 
similarity measurement unit 301 includes an image block 
validation unit 311, an image block similarity measurement 
unit 312, and a frame similarity judgment unit 313. The 
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image block validation unit 311 determines whether two 
image blocks in the same position in two video frames are 
a valid block pair. A valid block pair is an image block 
pair that has the ability to show the change of the image 
content. The image block similarity measurement unit 
312 calculates the similarity of two image blocks of the 
valid block pair and determines whether the two image 
. blocks are similar... The .frame similarity judgment unit 313_ 
determines whether the two video frames are similar by 
using a ratio of the number of similar image blocks to the 
total number of valid block pairs. According to the frame 
similarity measurement unit 301 , duplicate frames are 
efficiently detected and removed from the video frames. 

Fig. 5 shows the configuration of the text frame 
detection and verification unit 302 shown in Fig. 3. The 
text frame detection and verification unit 302 includes a 
fast and simple image binarization unit 321, a text line 
region determination unit 322, a rebinarization unit 323, 
text line confirmation unit 324, and text frame 
verification unit 325. The fast and simple image 
binarization unit 321 generates the first binary image of 
a video frame. The text line region determination unit 322 
determines the position of a text line region by using a 
horizontal projection and a vertical projection of the 
first binary image. The rebinarization unit 323 generates 
the second binary image of every text line region. The 
text line confirmation unit 324 determines the validity of 
a text line region by using the difference between the 
first binary image and the second binary image and a fill 
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rate of the number of foreground pixels in the text line 
region to the total number of pixels in the text line 
region. The text frame verification unit 325 confirms 
whether a set of continuous video frames are non-text 
frames that do not contain a text area by using the number 
of valid text line regions in the set of continuous video 
frames. According to the text frame detection and 
verification unit 302, non-text, frames are fast detected 
and removed from the video frames. 

Fig. 6 shows the configuration of the image shifting 
detection unit 303 shown in Fig. 3. The image shifting 
detection unit 303 includes a fast and simple image 
binarization unit 331, a text line vertical position 
determination unit 332, and a vertical shifting detection 
unit 333, a horizontal shifting detection unit 334. The 
fast and simple image binarization unit 331 generates 
binary images of two video frames. The text line vertical 
position determination unit 332 determines the vertical 
position of every text line region by using horizontal 
projections of the binary images- The vertical shifting 
detection unit 333 determines a vertical offset of image 
shifting between the two video frames and the similarity 
of the two video frames in the vertical direction by using 
the correlation between the horizontal projections. The 
horizontal shifting detection unit 334 determines a 
horizontal offset of the image shifting and the similarity 
of the two video frames in the horizontal direction by 
using the correlation between vertical projections of 
every text line in the binary images. According to the 
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image shifting detection unit 303, redundant frames caused 
by image shifting are fast detected and removed from the 
video frames. 

Figs. 7 and 8 show two frames that have same text 
content, rig. 9 shows the processing result of the frame 
similarity measurement unit 301 for these two frames. The 
white boxes in the fig. 9 mark out all valid image blocks 
-™- which are .-blocks- included by the valid block pairs and.=._. 
have the ability to show the change of the content. Boxes 
with solid line stand for similar image blocks and boxes 
with dashed line stand for dissimilar image blocks. Since 
the ratio of the number of similar image blocks tc the 
number of valid blocks is larger than a predefined 
threshold, these two images are considered as similar and 
the second frame is removed. 

Fig. 10 shows the flowchart of the operation of the 
frame similarity measurement unit 301 shown in Fig. 4. The 
comparison starts at 0 th frame of 0 th second (S5G1), the 
current i th frame is compared with the j th frame, which 
has a frame interval of STEP frames (S502) . If the i th 
frame is similar with the j th frame in comparing the two 
frames (S503) , then the current frame jumps to j th frame 
(S510) and the processes in S502 and S503 are repeated for 
comparison . 

If the two frame are different, comparison restarts 
from one frame after the current frame, which is the k th 
frame (S504 and S505) . It is checked whether k is less 
than j (S5C6) . If the k th frame is before the j th frame, 
and if the i th frame is similar with the k th frame 
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(S511), then the current frame is assigned as the k th 
frame (S512), and the processes in S5C2 and S503 are 
repeated for comparison . 

If the i th frame is different with the k th frame, 
then k increases by 1 (S505) , and it is checked whether k 
is less than j. If k is not less than j, that means the j 
th frame is different with the previous frames, the j th 
_frame is marked as a new candidate text change frame 

(S507) . A new search begins from the j th frame (S508) . If 
the sum of the index i of the current search frame and 
STEP is larger than the total number of input video frames 
nFrame (S509) , then the search is over and the found 
candidate text change frames are sent to the following 
units 302 and 303 for further processing. Otherwise, the 
search is continued. 

The purpose of the frame interval STEP is to reduce 
the total time for the search operation. If STEP is too 
big and the content of video changes rapidly, the 
performance will decrease. If the STEP is too small, the 
total search time will also be not very short. This frame 
interval is chosen as STEP = 4 frames, for example. 

Fig. 11 shows the flowchart of the operation of the 
determination of the similarity of two frames in S503 
shown in Fig. 10. The flowchart of the process in S511 is 
obtained by replacing j with k in Fig. 11. 

At start, the image block count n, the valid block 
count nValid, and the similar block count nSimiiar are all 
set to zero (S5I3) . Then the i th frame and the j th frame 
are divided into non-overlapped small image blocks with 
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size of NxN, and the number of the image blocks is 
recorded as nBlock (S514) . Here N = 16, for example. The 
two image blocks in the same position in the two frames 
are defined as an image block pair. For every image block 
pair, the image block validation unit 311 is used to check 
whether the image block pair is a valid block pair (S515) . 
The detection of the change between two frames is achieved 
. by detecting change in every image block pair. The 
background parts of a slide usually do not change, even if 
the content has been changed. So image block pairs in 
these parts should not considered as valid block pairs. 

If the block pair is invalid, then the next block 
pair is checked (S519 and S520) . If the block pair is a 
valid block pair, the valid block count nValid increases 
by 1 (S516), and the image block similarity measurement 
unit 312 is used to measure the similarity of the two 
image blocks (S517) . If the blocks are similar, the 
similar block count nSimilar increases by 1 (S518) . When 
all the block pairs are compared (S519 and S520), the 
frame similarity judgment unit 313 is used to determine 
whether the two frames are similar (S521) . The two frames 
are considered as similar if the following condition is 
met (S522) : 

nSimilar > nValid * simrate, 

here simrate = 0.85, for example. The two frames are 
considered as dissimilar if the above condition is not met 
(S523) . 
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Fig. 12 shows the flowchart of the operation of the 
image block validation unit 311 in S515 shown in Fig. 11. 
First, the mean and the variance of the n th image block 
pair are calculated (S524) . The means and the variances of 
the gray level of the image block in the i th frame are 
denoted by M(i) and V(i), respectively. The mean ■ and the 
variance of the gray level of the image block in the j th 
... frame are denoted by _M_( j )_ ar.d^V ( j )..,_ respectively. If two 
variances V(i) and V(j) of the block pair are ail smaller 
than a predefined threshold Tv (S525) , and the absolute 
difference of the two means M(i) and M(j) is also smaller 
than a predefined threshold Tm (S526) , then the image 
block pair is an invalid block pair (S527) . Otherwise, the 
image block pair is a valid block pair (S528) . 

Fig. 13 shows the flowchart of the operation of the 
image block similarity measurement unit 312 in S517 shown 
in Fig. 11. The means M(i) and M(j) of the n th image 
block pair is calculated first (S529) . If the absolute 
difference of the two means M(i) and M(j) is larger than a 
predefined threshold Tml (S530) , then the two image blocks 
are considered as dissimilar image blocks (S534) . 
Otherwise, the correlation of the two image blocks C(i, j) 
is calculated (S53I) . If the correlation C(i, j) is larger 
than a predefined threshold Tc (S532) , the two image 
blocks are similar (S533) , and if the correlation is 
smaller than the threshold Tc, the two image blocks are 
dissimilar (S534) . 

Figs. 14 to 21 show some example results of the 
processes performed by the text frame detection and 
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verification unit 302 shown in Fig. 5. Fig. 14 shows the 
original video frame. Fig. 15 shows the first binary image 
resulted from fast and simple image binarization . Fig. 16 
shows the result of horizontal binary projection. Fig. 17 
shows the result of projection regularization . Fig. 18 
shows the result of vertical binary projection in every 
candidate text line. Fig. 19 shows the result of text line 
region - determination . Gray rectangles indicate candidate 
text line regions* 

Fig. 20 shows the result of two pairs of binary 
images for two candidate text line regions marked in 
dashed line in Fig. 19. The first pair binary images 
contain text information. The difference between these two 
images is very small. So this text line region is regarded 
as a true text line region. The second pair cf binary 
images has very big difference. Since the different part 
is larger than a predefined threshold, this region is 
considered as non-text-line region. Fig. 21 shows the 
detected text line regions. 

Figs. 22 and 23 show the flowchart of the operation 
of the text frame detection and verification unit 302 
shewn in Fig. 3. First, continuous candidate frames 
section detection is performed to classify the candidate 
text frames outputted by the frame similarity measurement 
unit 301 into a plurality of sections, each section 
contains a serial of continuous candidate frames (S701). 
The number of the sections is denoted by nSection. Started 
from the first section (S702), if the number of the 
continuous candidate frames M(i) of the i th section is 
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larger than a predefined threshold Tncf (S703) , the fast 
and simple image binarization unit 321 is used to get the 
every binary image of all video frames (S704) . Then the 
text line region determination unit 322 using the 
horizontal and vertical projection of the binary image is 
used to determine the regions of the text lines (S705) . 

Next started from the first detected text line 
region -CS 7 0.6) , the rebinarization unit 323 is used to make 
a second binary image of the text line region (S707) . The 
rebinarization unit 323 uses Niblack' s image binarization 
method on the whole region of every detected text line to 
get the binary image. The two binary images of the same 
text line region are compared by the text line 
confirmation unit 324 (S7C8) . If the two binary images are 
similar, then the similar text line count for the i th 
section nTextLine(i) increases by 1 (S709) . This procedure 
repeat for ail text lines in these M(i) continuous 
candidate frames (S710 and S711) . 

Sometime a non-text frame will be detected as 
containing some text lines, but if a serial of candidate 
frames do not contain any text line, it is unlikely that 
the total number of the text lines detected in these 
frames will be very big. So the text frame verification 
unit 325 is used to confirm whether the serial of 
candidate text frames are non-text frames. The serial of 
the candidate text frames are considered as non-text 
frames if the following condition is met (S712) : 

nTextLine (i) < cxM(i) , 
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and these false candidate text frames are removed (S713) . 
Here, a is a positive real number that is determined by 
experiment. Usually it is set as a = 0.8. The procedure 
repeats for all continuous candidate frames sections (S714 
and S715) . 

Fig. 24 shows the flowchart of the operation of the 
- fast and simple binarization unit 321 in S704 shown in Fig. 
22. The frame image is first divided into non-overlapped 
image blocks with size of NxN, and the number of the image 
blocks is recorded as nBlock (S716) . Here N = 16, for 
example. Started from the first image block (S717), every 
image block is binarized using Niblack's image binariation 
method (S718) . The parameter k for Niblack's image 
binarization is set as k = -0.4. The procedure repeats for 
ail image blocks (S719 and S720) . 

Fig. 25 shows the flowchart of Niblack's image 
binarization method in S718 shown in Fig. 24. The input is 
a gray level image of size MxN. First, the mean Mean and 
the variance var of the image is calculated (S721) . If the 
variance Var is less than a predefined threshold Tv (S722), 
then all pixels in the binary image are set to 0. If Var > 
Tv, a binary threshold T is calculated by the following 
equation : 

T = Mean + k * Var. 

For every image pixel i, if the gray level gray(i) 
for of the pixel is larger than T (S726) , the pixel in the 
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binary image bin(i) is set to 0 (S727) , otherwise, the 
pixel is set to 1 (S728) . The procedure repeats for all 
pixels in the binary image (S729 and S730) . 

Fig. 2 6 shows the flowchart of the operation of the 
text line region determination unit 322 in S705 shown in 
Fig. 22. The input of this unit is the binary image of the 
video frame from S704. The horizontal image projection 
Prjh is first calculated (S731) . The projection is then 
smoothed (S732) and regularized (S733) . The result of the 
regularization of Prjh is Prjhr, which has only two 
values: 0 or 1. 1 means that the position has a large 
projection value, 0 means that the position has a small 
projection value. The start and end points of each l's 
region in the Prjhr are recorded as sy(i) and ey(i), 
respectively (S734) . For each l's region in Prjhr, the 
vertical image projection Prjv(i) is calculated (S735) . 
Prjv(i) is smoothed (S736) and regularized as Prjvr(i) 

(S737) . Two l's regions in Prjvr(i) are connected into one 
region if the distance between the two l's regions is less 
than 2 * region height, and the start and end points of 
the connected region are recorded as sx(i) and ex(i), 
respectively (S738) . The output sx(i), ex(i), sy(i) and 
ey(i) determine the i th region of the text line (S739) . 

Fig. 27 shows the flowchart of horizontal image 
projection in S731 shown in Fig. 26. Started from the 
first horizontal line (S740), the projection for the i th 
horizontal line is calculated by the following equation 

(S741) : 
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where I (i, j) is the pixel value in the i th row and j th 
column and w is the width of the image. The calculation 
repeats for all horizontal lines in the image with h as 
the height of the image (S742 and S743) . 

Fig. 28 shows the flowchart of projection smoothing 
_ _ in S732 shown in Fig. 26. Started from the radii of the 
smoothing window, ft (S744), the value for the i th point 
in the smoothed projection prjs(i) is calculated by the 
following equation (S745) : 

where the length of the smoothing window is 2 * 6 + 1 . The 
calculation repeats for all points in the smoothed 
projection with L as the range for smoothing (S746 and 
S747) . 

Fig. 29 shows the flowchart of projection 
regularization in S733 shown in Fig. 26. At first, all 
local maxima in the projection are detected (S748) . The 
value for every pixel in the regularized projection Prj r 
is set to 0 (S749) . Started from the first local maximum 
max(i) (S750), two nearby local minima mini (i) and min2(i) 
are detected (S751) . 

Fig. 3 0 shows an exemplary drawing of the max(i), 
mini(i) and min2(i) positions in a projection curve. There 
are three local maxima. P2, P4 and P6 are max(l), max (2) 
and max (3), respectively. PI is the upper minimum minl(l) 
for max(l), P3 is the bottom minimum min2(l) for max(l). 
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P3 is also the upper minimum mini (2) for max (2). Similarly, 
P5 is the bottom minimum min2(2) for max(2), and also is 
the upper minimum mini (3) for max (3). P7 is the bottom 
minimum min2(3) for max (3). 

If minl(i) < max(i)/2 and min2(i) < max(i)/2 (3152), 
then the values in the regularized projection Prjr between 
the positions of minl(i) and min2(i) are set to 1 (S753) . 
The procedure repeats for -every- local, maximum (S754 and 
S755) . 

Fig. 31 shows the flowchart of the operation of the 
text line confirmation unit 324 in S708 shown in Fig. 22. 
The input of this unit is two binary images II and 12 with 
size wxh of the same text line region. First the counters 
countl, count2 and count are set to 0 (S756) . count means 
the number of pixels where the value of two corresponding 
pixels in II and 12 are all 1. countl means the number of 
pixels where the value of the corresponding pixel in II is 

I and that in 12 is 0. count2 means the number of pixels 
where the value of the corresponding pixel in 12 is 1 and 
that in II is 0. 

Started from the first position in the two images f 
if corresponding two pixels II (i) and 12 (i) are both .1, 
then count increases by 1 (S757 and S758) . Otherwise, if 

II (i) is 1, then countl increases by 1 (S759 and S760) . 
Otherwise, if 12 (i) is i, then count2 increases by 1 (S761 
and S762) . After all pixels are checked (S763 and S764), 
it is checked whether the following conditions are met 
(S765 and S766) : 
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count + countl < w * h/2, 

count + count2 < w * h/2, 

countl < count * 0.2, 

ccunt2 < count x 0.2, 
fillrate < 0.5. 

The 'fillrate' of a text line region is defined as 
-the -rate of the number of foreground pixels to the number 
of total pixels in the region. If the above conditions are 
met, then two binary images are considered as similar in 
this text line region and the text line region is 
considered as a valid text line (S768) . If one of these 
conditions is not met, the text line region is considered 
as an invalid text line (S767) . 

Figs. 32 and 33 show the flowchart of the operation 
of the image shifting detection unit 303 shown in Fig. 6. 
For two continuous frames, frame i and frame j, first the 
fast and simple image binarization unit 331 is used to get 
the binary image of the two frames (S801) . Then the text 
line vertical position determination unit 332 is used to 
perform the horizontal image projection as in S731 shown 
in Fig. 26 for obtaining the horizontal projections Prjyi 
and Prjyj for frame i and frame j, respectively (S802) . 
The vertical shifting detection unit 333 is then used to 
calculate the correlation function Cy(t) of the two 
projections (S803) . 

Here, a correlation function C (t) of two projections 
Prjl(x) and Prj2(x) is defined as: 
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C(t) = ~^ I(Pq 1 (a) - M l)*(Pij2(A- + 0 - M2) 

Where L is the length of the projection, and Ml and M2 are 
the means cf the projections Prji and Prj2, respectively. 
VI and V2 are the variances of Prjl and Prj2, respectively. 

If the maximum of Cy(t) is less than 9C% (S804), 
then the two images are not shifting images. Otherwise, 
= the position of = the maximum value of Cy (t ) is recorded as 
the vertical offset offy (S805), and the projection 
regularization as in S733 is performed to get the 
regularized projection Prjyir of projection Prjyi (S806) . 
If frame j is a shifting version of frame i, the vertical 
shifting offset of frame j is represented by offy. Every 
1's region in Prjyir is considered as a candidate text 
line region, which is indicated by the start and end 
points syi and eyi (S807) . The number of the candidate 
text line regions is recorded as nCanTL. 

Started from the first candidate text line region, 
the matching count nMatch is set to 0 (S808) . The c th 
corresponding shifting candidate text line region in frame 
j is assumed to be represented by syj (c) = syi (c) + offy 
and eyj (c) = eyi (c) + offy (S809) . For two corresponding 
candidate text line regions, the vertical projections are 
calculated (S810) . Then the horizontal shifting detection 
unit 334 is used to calculate the correlation function 
Cx(t) for the two vertical projections is calculated, and 
the position of the maximum value of Cx(t) is recorded as 
the horizontal offset offx, for these two projections 
(S811) . If the maximum of Cx(t) is larger than 90% (S812), 
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the two candidate text line regions are considered as 
matched shifting text line regions and the matching count 
nMatch increases by 1 (S813) . After every candidate text 
line pair are checked (S814 and S815) , if the number of 
the matched shifting text line regions is larger than 70% 
of the number of candidate text line regions (S816) , frame 
j is regarded as a shifting version of frame i (S817) . 
- Otherwise, frame j is not. a shifting, frame, of frame i 
(S818) . 

Fig. 34 shows the configuration of the text 
extraction apparatus 105 shown in Fig. 1. The text 
extraction apparatus comprises an edge image generation 
unit 901 for extracting the edge information of the video 
frame, a stroke image generation unit 902 using the edge 
image for generating the stroke image of the candidate 
character strokes, a stroke filtering unit 903 for 
removing false character strokes, a text line region 
formation unit 904 for connecting nearby strokes into a 
text line region, a text line verification unit 905 for 
delete false character stroke in the text line region, and 
a text line binarization unit 906 for obtaining the final 
binary image of the text line region. The output of the 
text extraction apparatus is a list of binary images for 
all text line regions in the frame. According to the text 
extraction apparatus 105, the text line region can be 
accurately binarized since the false strokes are detected 
and removed as much as possible. 

Fig. 3 5 shows the configuration of the edge image 
generation unit 901 shown in Fig. 34. The edge image 
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generation unit 901 includes an edge strength calculation 
unit 911, a first edge image generation unit 912, and a 
second edge image generation unit 913. The edge strength 
calculation unit 911 calculates edge strength for every 
pixel in a video frame by using a Sobei edge detector. The 
first edge image generation unit 912 generates the first 
edge image by comparing the edge strength of every pixel 
- - with a predefined- edge, threshold, and sets a value of a 
corresponding pixel in the first edge image to one binary- 
value if the edge strength is . greater than the threshold 
and the ether binary value if the edge strength is less 
than the threshold. For example, logic "1" is used as the 
one binary value, which may indicate a white pixel, and 
logic "Q" is used as the other binary value, which may 
indicate a black pixel. The second edge image generation 
unit 913 generates the second edge image by comparing the 
edge strength of every pixel in a window centered at the 
position of every pixel of the one binary value in the 
first edge image with mean edge strength of the pixels in 
the window, and sets a value of a corresponding pixel in 
the second edge image to the one binary value if the edge 
strength of the pixel is greater than the mean edge 
strength and the other binary value if the edge strength 
of the pixel is less than the mean edge strength. A small 
window of size of 3x3, for example, is used for the second 
edge image generation. 

Fig. 36 shows the configuration of the stroke image 
generation unit 902 shown in Fig. 34. The stroke image 
generation unit 902 includes a local image binarization 
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unit 921. The local image binarization unit 921 binarizes 
a gray scale image of the video frame in the Niblack's 
binarization method to obtain a binary image of candidate 
character strokes by using a window centered at the 
position of every pixel of the one binary value in the 
second edge image, A window of size of 11x11, for example, 
is used for the local image binarization. 

- • Fig. 37 shows the configuration of ..the stroke 
filtering unit 903 shown in Fig. 34. The stroke filtering 
unit 903 includes a stroke edge coverage validation unit 
931 and a long straight line detection unit 932. The 
stroke edge coverage validation unit 931 checks an overlap 
rate of a contour of a stroke in the binary image of the 
candidate character strokes by pixels of the one binary 
value in the second edge image, determines that the stroke 
is a valid stroke if the overlap rate is greater than a 
predefined threshold and an invalid stroke if the overlap 
rate is less than the predefined threshold, and removes 
the invalid stroke as a false stroke. The long straight 
line detection unit 932 removes a very large stroke as a 
false stroke by using s width and a height of the stroke. 
According to the stroke filtering unit 903 f false strokes 
unnecessary for a text line region are detected and 
removed from the binary image of the candidate character 
strokes . 

Fig. 38 shows the configuration of the text line 
region formation unit 904 shown in Fig. 34. The text line 
region formation unit 9C4 includes a stroke connection 
checking unit 941. The stroke connection checking unit 941 
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checks whether two adjacent strokes are ccnnectable by 
using an overlap ratio of heights of the two strokes and a 
distance between the two strokes. The text line region 
formation unit 904 combines strokes into a text line 
region by using the result of the checking. 

Fig. 39 shows the configuration of the text line 
verification unit 905 shown in Fig. 34. The text line 
verification unit 905 includes a vertical raise stroke 
detection unit 951, a horizontal false stroke detection 
unit 952 f and a text line reformation unit 953. The 
vertical false stroke detection unit 951 checks every 
stroke with a height higher than the mean height of 
strokes in the text line region, and marks the stroke as a 
false stroke if the stroke connects two horizontal text 
line regions into one big text line region. The horizontal 
false stroke detection unit 952 checks every stroke with a 
width larger than a threshold determined by the mean width 
of the strokes in the text line region, and marks the 
stroke as a false stroke if the number of strokes in a 
region that contains the stroke is less than a predefined 
threshold. The text line reformation unit 953 reconnects 
strokes except for a false stroke in the text line region 
if the false stroke is detected in the text line region. 
According to the text line verification unit 905, false 
strokes are further detected and removed from the text 
line region. 

Fig. 40 shows the configuration of the text line 
binarization unit 906 shown in Fig. 34. The text line 
binarization unit 906 includes an automatic size 
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calculation unit 961 and a block image binarization unit 
962. The automatic size calculation unit S61 determines a 
size of a window for binarization. The block image 
binarization unit 962 binarizes a gray scale image of the 
video frame in the Niblack' s binarization method to obtain 
a binary image of a text line region by using the window 
centered at the position of every pixel of the one binary 
value in the second edge image. According to such text 
line binarization after removing the false strokes, the 
text line region can be accurately binarized. 

Figs. 41 to 4 6 show some results of the text 
extraction apparatus. Fig. 41 shows the original video 
frame. Fig. 42 shows the result for edge image generation, 
which is the final edge image (second edge image). Fig. 43 
shows the result of stroke generation. Fig. 44 shows the 
result of stroke filtering. Fig. 45 shows the result of 
text line formation. Fig. 4 6 shows the result of the 
refined final binarized text line regions. 

Figs. 47 and 48 show the flowchart of the operation 
of the edge image generation unit 901 shown in Fig. 35. 
First all the values of the pixels Edgelmgl (i) in the 
first edge image Edgelmgl of size WxH are set to 0 (S1101) . 
Started from the first pixel (S1102) , the edge strength 
calculation unit 911 is then used to calculate edge 
strength E (i) of the i th pixel using Sobel edge detector 

(51103) . Next the first edge image generation unit 912 is 
used to determine the value of Edgelmgl (i). If the edge 
strength is larger than a predefined threshold Tedge 

(51104) , then the value of this pixel in the first edge 
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image is set to I, Edgelmgl (i) - 1 (S1105) . This procedure 
continues until all the pixels are checked (S1106 and 
S1107) . 

After the first edge image is obtained, ail the 
values Edgelmg2(i) for the second edge image Edgelmg2 of 
size WxH are initialized to 0 (S1108) . Scanned from the 
first pixel (S1109), if the value of the pixel in the 
first edge image is 1 J SI 110) , then the mean edge strength 
of the neighborhood pixels is obtained according to the 
arrangement of the neighborhood 1116 of pixel i shown in 
Fig. 49 (S11I1) . The second edge image generation unit 913 
is then used to determine the values for these 
neighborhood pixels in the second edge image by comparing 
the edge strength of a pixel with the mean edge strength 
(S1112) . If the edge strength is larger than the mean edge 
strength, the pixel value in the second edge image is set 
to 1, otherwise the value is set to 0. After ail pixels in 
the first edge image are checked (S1113 and S1IT4) , the 
second edge image is outputted as the final edge image 
Edgelmg (S1115) . 

Fig. 50 shows the flowchart of the operation of the 
edge strength calculation unit 911 in S11C3 shewn in Fig. 
47. For the i th pixel, the horizontal and the vertical 
edge strengths Ex(i) and Sy(i) are first obtained in the 
neighborhood area 1116 shown in Fig. 49 by the following 
equations (S1117 and S1118): 

Sx(i) = i(d) + 2*1 (e) + 1(f) - 1(b) - 2*1 (a) - 1(h), 
Sy(i) = 1(b) + 2*I(c) + 1(d) - 1(h) - 2*I(g) - 1(f), 
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where I (x) represents the gray level of the x th pixel <x 
= a, b, c, d, e, f, g, h) . The total edge strength E(i) is 
calculated by the following equation (S1119) : 

E(i) = V&<0 * &(0 + £y(J) * • 

The mean .edge strength of pixel i in SI 111 shown in 
Fig. 48 is calculated by the following equation: 

Medge(i) - (E(a) + E(b) + E(c) + E (d) + 'E(e) + E(f) + E (g) 
+ E(h) + E (i) ) / 9. 

Fig. 51 shows the flowchart of the operation of the 
stroke image generation unit 902 shown in Fig. 36. The 
stroke image of size WxH is first initialized to 0 (S1201) . 
Then the local image binarization unit 921 is used to 
determine the values of the pixels of the stroke image. 
Started from the first pixel (S1202), if the value of the 
i th pixel Edgelmg(i) in the edge image Edgelmg is 1 
(S1203) , a 11x11 window is set at the gray level frame 
image centered at the pixel's position and the values of 
the pixels of the stroke image in the window are 
determined by Niblack's binarization method shown in Fig. 
25 (S1204) . After all pixels are checked in the edge image 
(S1205 and S1206) , the stroke image is generated. 

Fig. 52 shows the flowchart of the operation of the 
stroke filtering unit 903 shown in Fig. 37. First the long 
straight line detection unit 932 is used to delete very 
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large strokes. Started from the first stroke (S1301) , if 
the width or height of the stroke exceeds a predefined 
threshold MAXSTROKESIZE (S1302) , this stroke is deleted 
(S1304) . Otherwise, the stroke edge coverage validation 
unit 931 is used to check the validity of the stroke 
(S1303) . A valid stroke means a candidate character stroke 
and an invalid stroke is not a true character stroke. If 
the stroke is invalid, it is_ deleted (S1304). The checking 
is repeated for ail strokes found in the stroke image with 
nStroke as the number of the strokes (S1305 and S1306) . 

Fig. 53 shows the flowchart of the operation of the 
stroke edge coverage validation unit 931 in S1303 shown in 
Fig. 52. First the contour C of the stroke is obtained 
(S1307) . From the first contour point (S1308) , the pixel 
values of Edgelmg in the neighborhood area of the current 
contour point are checked (S1309) . As shown in Fig. 49, 
point a to point h are considered as the neighborhood 
points of point i. If there is a neighbor edge pixel which 
has a value of 1, then this contour point is regarded as a 
valid edge contour point and the count of valid edge 
contour points nEdge increases by 1 (S1310) . After all 
contour points are checked with nContour as the number of 
the contour points (S1311 and S1312) , if the number of the 
valid edge contour points is larger than 0.8*nContour 
(S1313) , the stroke is considered as a valid stroke, that 
is, a candidate character stroke (S1314) . Otherwise, the 
stroke is an invalid stroke (S1315) . An invalid stroke is 
deleted from the stroke list. The rate of nEdge to 
nContour in SI 313 represents the overlap rate. 
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Fie. 54 shows the flowchart of the operation of the 
text line region formation unit 904 shown in Fig. 38. 
First the region of every stroke is set as an individual 
text line region and the number of text line nTL is set to 
nStroke (S14G1) . Started from the first stroke (S1402) , 
stroke j next tc stroke i is selected (S1403) and it is 
checked whether stroke i and stroke j belong to one text 
— line region (S1404) . If not, the stroke connection 

checking unit 941 is used to check whether these two 
strokes are ccnnectable (S1405) . If so, all the strokes in 
these two text lines, a text line to which stroke i 
belongs and a text line to which stroke j belongs, are 
combined into one big text line (S1406) and the number of 
text line decreases by 1 <S1407) . 

Here, a text line is a group of ccnnectable strokes 
and every stroke has an attribute of a text line. If 
stroke i belongs to the m th text line, stroke j belongs 
to the n th text line, and stroke i is ccnnectable with 
stroke j, then the attributes of all strokes in the m th 
and the n th text lines are set to m. After every pair of 
the strokes are checked (S1408, S1409, S1410 and S1411) , 
nTL is the number of the text lines in the frame. 

Fig. 55 shows the flowchart of the operation of the 
stroke connection checking unit 941 in S1405 shown in Fig. 
54. First, the heights of the two strokes hi and h2 are 
obtained and the higher height is marked as maxh and the 
lower height is marked as minh (S1412) . If the horizontal 
distance between the centers of stroke i and stroke j is 
larger than I.5*maxh (S1413) , then these two strokes are 
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not connectable (S1417) . Otherwise, the number of the 
horizontal lines that has intersection with both stroke i 
and stroke j is recorded as nOverlap (S1414). If nOverlap 
is larger than 0.5*minh (S1415) , then these two strokes 
are connectable (S14I6) . Otherwise, these two strokes are 
not connectable (S1417) . The ratio of nOverlap to minh in 
S1415 represents the overlap ratio. 
- -.=. . Fig. 56 shows the flowchart of the operation of the 

text line verification unit 905 shown in Fig. 39. First, 
the modification flag modflag is set to false (S1501) . 
Started from the first text line region (S1502), if the 
height of the i th text line region Height (i) is less than 
a predefined threshold MINTLHEIGHT (S1503) , this text line 
region is deleted (S1504). Otherwise, a vertical false 
stroke detection unit 951 and a horizontal false stroke 
detection unit 952 are used to detect a false stroke 
(S1505 and S1506) . If a false stroke is detected, then the 
stroke is deleted (S1507) , the remaining strokes are 
reconnected (S1508) using the text line reformation unit 
953, and the modification flag is set to true (S1509) . The 
text line reformation unit 953 reconnects the remaining 
strokes in the same manner as the text line region 
formation unit 904. After all the text line regions are 
checked (S151C and S1511), if the modification flag is 
true (S1512), then the whole process is repeated again 
until no false stroke is detected. 

Fig. 57 shows the flowchart of the operation of the 
vertical false stroke detection unit 951 in S1505 shown in 
Fig. 56. The mean height of the strokes in the text line 
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region is first calculated (S1513) . Started from the first 
stroke (S1514) , if the height of stroke i is larger than 
the mean height (S1515) , then multiple text line detection 
is performed to check the strokes in an area to the left 
of stroke i (S1516) . The area to the left of stroke i is a 
region inside a text line region, and the left, up, and 
bottom boundaries of this area are the left, up and bottom 
boundaries, respectively, of the text line region. The 
right boundary of this area is the left boundary of stroke 
i. If there are two or more non-overlapped horizontal text 
line regions in the area to the left of stroke i, stroke i 
is a vertical false stroke (S1520) . 

Otherwise, multiple text line detection is then 
performed to check the strokes in an area to the right of 
stroke i (S1517) . The area to the right of stroke i has a 
similar definition to that of the area to the left of 
stroke i. If there are two or more non-overlapped 
horizontal text line regions in the area to the right of 
stroke i, stroke i is a vertical false stroke (S1520) . The 
procedure repeats until every stroke in the text line 
region is checked (S1518 and S1519) . 

Fig. 58 shows the flowchart of multiple text line 
detection in S1516 and S1517 shown in Fig. 57. First, the 
strokes are connected in the same manner as the text 
region formation unit 904 (S1521). if the number of the 
text line regions nTextLine is more than 1 (SI 522) , then 
it is checked whether the following three conditions are 
met . 

1. There are two non-overlapped text line regions (S1523) . 
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2. One text line region is above the other text line 
region (S1524) . 

3. Number of the strokes in each text line region is 
larger than 3 (S1525) . 

If ail the three conditions are met, then multiple text 
lines are detected (S1526) . 

Fig. 59 shows the flowchart of the operation of the 
horizontal false stroke detection unit 952 in S15C6 shown 
in Fig. 56. First, the mean width of all the strokes in 
the text line region is calculated (S1527) . Started from 
the first stroke (S1528), if the width of stroke i is 
larger than 2.5 times the mean stroke width (S1529) , then 
a detection region R is set (S1530) . The left boundary 
R.ieft and the right boundary R. right of R are determined 
by the left boundary Stroke (i) .Left and the right boundary 
Stroke (i) .Right, respectively, of stroke i. The top 
boundary R.top and the bottom boundary R. bottom of R are 
determined by the top boundary textline.top and the bottom 
boundary textline .bottom, respectively, of the text line 
region. The number of strokes in detection region R is 
calculated (S1531) , if the number is less than or equal to 
3 (S1532) , then stroke i is marked as a horizontal false 
stroke (S1533) . The procedure repeats until every stroke 
in the text line region is checked (S1534 and S1535) . 

Figs. 60 and 61 show examples of a false stroke. 
Stroke 1541 shown in Fig. 60 is a vertical false stroke 
and stroke 1542 shown in Fig. 61 is a horizontal false 
stroke . 

Fig. 62 shows the flowchart of the operation of the 



tbfiE#2 003-3090803 



2002-378577 



^-v: 43/ 



text line binarization unit 906 shown in Fig. 40. First, 
the automatic size calculation unit 961 is used to 
determine the size of the window wh for binarization based 
on the height of the text line region Height (S1601), 
which must satisfy the following three conditions: 

wh = Height / 3, 

wh = wh + 1 if wh is S_ n even number, 
wh = 5 if wh < 5. 

After that/ the block image binarization unit 962 is used 
to rebinarize the text line region (S1602) . The block 
image binarization unit 962 sets the window size of 
Nibiack's binarization method to wh and rebinarizes the 
text line region in the same manner as the stroke image 
generation unit 902. 

The video text processing apparatus or each of the 
text change frame detection apparatus 104 and the text 
extraction apparatus 105 shown in Fig. 1 is configured, 
for example, using an information processing apparatus 
(computer) as shown in Fig. 63. The information processing 
apparatus shown in Fig. 63 comprises a CPU (central 
processing device) 1701, a memory 1702, an input device 
1703, an output device 1704, an external storage device 
1705, a medium drive device 1706, a network connection 
device 1707, and a video input device 1708. They are 
interconnected through a bus 1709. 

The memory 1702 includes, for example, ROM (read 
only memory), RAM (random access memory), etc. and stores 
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programs and data for use in the processes. The CPU 1701 
performs a necessary process by executing the program 
using the memory 1702. In this case, the units 301 to 303 
shown in Fig. 3 and the units 901 to 906 shown in Fig. 34 
correspond to the programs stored in the memory 1702. 

The input device 1703 is, for example, a keyboard, a 
pointing device, a touch panel, etc., and used to input an 
instruction and- information .from a user . . ... The output 
device 1704 is, for example, a display, a printer, a 
speaker, etc., and used to output an inquiry to the user 
and a process result. 

The external storage device 1705 is, for example, a 
magnetic disk device, an optical disk device, a magneto- 
optical disk device, a tape device, etc. The information 
processing apparatus stores the programs and data in the 
external storage device 1705, and loads them to the memory 
1702 to use them as necessary. The external storage device 
1705 is also used as a database storing the existing video 
data 101 shown in Fig. 1. 

The medium drive device 1706 drives a portable 
storage medium 1710, and accesses the stored contents. 
The portable storage medium 1710 is an arbitrary computer- 
readable storage medium such as a memory card, a flexible 
disk, CD-ROM (compact disk read only memory) , an optical 
disk, a magneto-optical disk, etc. The user stores the 
programs and data in the portable storage medium 1710, and 
loads them to the memory 1702 to use them as necessary. 

The network connection device 1707 is connected to 
an arbitrary communications network such as a LAN (local 
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area network) , Internet, etc., and converts data during 
the communications. The information processing apparatus 
receives the programs and data through the network 
connection device 1707, loads them to the memory 1702 to 
use them as necessary. 

The video input device 1708 is, for example, the TV 
video camera 102 shown in Fig. 1 and is used to input the 
living video stream. .... .. 

Fig. 64 shows computer-readable storage media 
capable of providing a program and data for the 
information processing apparatus shown in Fig. 63. The 
program and data stored in the portable storage medium 
171C and a database 1803 of a server 18C1 are loaded to 
the memory 1702 of an information processing apparatus 
1802. The server 1801 generates a propagation signal for 
propagating the program and data, and transmits it to the 
information processing apparatus 1802 through an arbitrary 
transmission medium in a network. The CPU 1701 executes 
the program using the data to perform a necessary process, 
(addition 1) A text change frame detection apparatus 

that selects a plurality of video frames including text 
contents from given video frames, characterized in that 
said apparatus comprises: 

first frame removing means for removing redundant 
video frames from the given video frames; 

second frame removing means for removing video 
frames that do not contain a text area from the given 
video frames; 

third frame removing means for detecting and 
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removing redundant video frames caused by image shifting 
from the given video frames; and 

output means for outputting remaining video frames 
as candidate text change frames. 

(addition 2) The text change frame detection 

apparatus according to addition 1, characterized in that 
the first frame removing means includes: 

image block validation means for determining whether 
two image blocks in the same position in two video frames 
of the given video frames are a valid block pair that has 
an ability to show a change of image contents; 

image block similarity measurement means for 
calculating a similarity of two image blocks of the valid 
block pair and determining whether the two image blocks 
are similar; and 

frame similarity judgment means for determining 
whether the two video frames are similar by using a ratio 
of a number of similar image blocks to a total number of 
valid block pairs, 

and the first frame removing means removes a similar video 
frame as a redundant video frame. 

(addition 3) The text change frame detection 

apparatus according to addition 1, characterized in that 
the second frame removing means includes: 

fast and simple image binarization means fcr 
generating a first binary image of a video frame of the 
given video frames; 

text line region determination means for determining 
a position of a text line region by using a horizontal 
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projection and a vertical projection of the first binary 
image; 

rebinarization means for generating a second binary 
image of every text line region; 

text line confirmation means for determining 
validity of a text line region by using a difference 
between the first binary image and the second binary image 
and a fill- rate of a number of foreground pixels in the 
text line region to a total number of pixels in the text 
line region; and 

text frame verification means for confirming whether 
a set of continuous video frames are non-text frames that 
do not contain a text area by using a number of valid text 
line regions in the set of continuous video frames, 
(addition 4) The text change frame detection 

apparatus according to addition 1, characterized in that 
the third frame removing means includes: 

fast and simple image binarization means for 
generating binary images of two video frames of the given 
video frames; 

text line vertical position determination means for 
determining a vertical position of every text line region 
by using horizontal projections of the binary images of 
the two video frames; 

vertical shifting detection means for determining a 
vertical offset of image shifting between the two video 
frames and a similarity of the two video frames in a 
vertical direction by using correlation between the 
horizontal projections; and 
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horizontal shifting detection means for determining 
a horizontal offset of the image shifting and a similarity 
of the two video frames in a horizontal direction by using 
correlation between vertical projections of every text 
line in the binary images of the two video frames, 
and the third frame removing means removes a similar video 
frame as a redundant video frame caused by the image 
shifting . ... . . _ .. .- 

(addition 5) A text change frame detection apparatus 

that selects a plurality of video frames including text 
contents from given video frames, characterized in that 
said apparatus comprises: 

image block validation means for determining whether 
two image blocks in the same position in two video frames 
of given video frames are a valid block pair that has an 
ability to show a change of image contents; 

image block similarity measurement means for 
calculating a similarity of two image blocks of the valid 
block pair and determining whether the two image blocks 
are similar; 

frame similarity judgment means for determining 
whether the two video frames are similar by using a ratio 
of a number of similar image blocks to a total number of 
valid block pairs; and 

output means for outputting remaining video frames 
after a similar video frame is removed, as candidate text 
change frames. 

(addition 6) A text change frame detection apparatus 

that selects a plurality of video frames including text 
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contents from given video frames, characterized in that 
said apparatus comprises: 

fast and simple image binarization means for 
generating a first binary image of a video frame of the 
given video frames; 

text line region determination means for determining 
a position of a text line region by using a horizontal 
- ----- —projection and. a vertical, project ion of the first binary 

image ; 

rebinarization means for generating a second binary 
image of every text line region; 

text line confirmation means for determining 
validity of a text line region by using a difference 
between the first binary image and the second binary image 
and a fill rate of a number of foreground pixels in the 
text line region to a total number of pixels in the text 
line region; 

text frame verification means for confirming whether 
a set of continuous video frames are non-text frames that 
do not contain a text area by using a number of valid text 
line regions in the set of continuous video frames; and 

output means for outputting remaining video frames 
after the non-text frames are removed, as candidate text 
change frames. 

(addition 7) A text change frame detection apparatus 

that selects a plurality of video frames including text 
contents from given video frames, characterized in that 
said apparatus comprises: 

fast and simple image binarization means for 
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generating binary images of two video frames of the given 
video frames; 

text line vertical position determination means for 
determining a vertical position of every text line region 
by using horizontal projections of the binary images of 
the two video frames; 

vertical shifting detection means for determining a 
-vertical offset of image ^shifting between the two video 
frames and a similarity of the two video frames in a 
vertical direction by using correlation between the 
horizontal projections; 

horizontal shifting detection means for determining 
a horizontal offset of the image shifting and a similarity 
of the two video frames in a horizontal direction by using 
correlation between vertical projections of every text 
line in the binary images of the two video frames; and 

output means for outputting remaining video frames 
after a similar video frame is removed, as candidate text 
change frames. 

(addition 8) A text extraction apparatus that 

extracts at least one text line region from a given image, 
characterized in that said apparatus comprises: 

edge image generation means for generating edge 
information of the given image; 

stroke image generation means for generating a 
binary image of candidate character strokes in the given 
image by using the edge information; 

stroke filtering means f.or removing a false stroke 
from the binary image by using the edge information; 
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text line region formation means for combining a 
plurality of strokes into a text line region; 

text line verification means for removing a false 
character stroke from the text line region and reforming 
the text line region; 

text line binarization means for binarizing the text 
line region by using a height of the text line region; and 

output means for. outp.ut.ting a binary image of the 
text line region, 

(addition 9) The text extraction apparatus according 

to addition 8, characterized in that the edge image 
generation means includes: 

edge strength calculation means for calculating edge 
strength for every pixel in the given image by using a 
Sobel edge detector; 

first edge image generation means for generating a 
first edge image by comparing the edge strength of every 
pixel with a predefined edge threshold and setting a value 
of a corresponding pixel in the first edge image to one 
binary value if the edge strength is greater than the 
threshold and the other binary value if the edge strength 
is less than the threshold; and 

second edge image generation means for generating a 
second edge image by comparing the edge strength of every 
pixel in a window centered at a position of every pixel of 
the one binary value in the first edge image with mean 
edge strength of the pixels in the windov; and setting a 
value of a corresponding pixel in the second edge image to 
the one binary value if the edge strength of the pixel is 
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greater than the mean edge strength and the other binary 
value if the edge strength of the pixel is less than the 
mean edge strength. 

(addition 10) The text extraction apparatus according 

to addition 3, characterized in that the stroke image 
generation means includes a local image binarization means 
for binarizing a gray scale image of the given image in a 
Niblack' s binarization method to obtain the binary image 
of the candidate character strokes by using a window 
centered at a position of every pixel of the one binary 
value in the second edge image. 

(addition 11) The text extraction apparatus according 

to addition 9, characterized in that the stroke filtering 
means i ncludes : 

stroke edge coverage validation means for checking 
an overlap rate of a contour of a stroke in the binary 
image of the candidate character strokes by pixels of the 
one binary value in the second edge image, determining 
that the stroke is a valid stroke if the overlap rate is 
greater than a predefined threshold and an invalid stroke 
if the overlap rate is less than the predefined threshold, 
and removing the invalid stroke; and 

long straight line detection means for removing a 
large stroke by using a width and a height of the stroke, 
(addition 12) The text extraction apparatus according 

to addition 9, characterized in that the text line 
binarization means includes: 

automatic size calculation means for determining a 
size of a window for binarization; and 
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block image binarization means for binarizing a gray 
scale image of the given image in a Nibiack' s binarization 
method to obtain the binary image of the text line region 
by using the window centered at a position of every pixel 
of the one binary value in the second edge image, 
(addition 13) The text extraction apparatus according 

to addition 8, characterized in that the text line region 
formation means includes a stroke connection checking 
means for checking whether two adjacent strokes are 
connectable by using an overlap ratio of heights of the 
two strokes and a distance between the two strokes, and 
the text line region formation means combines the 
plurality of strokes into a text line region by using a 
result of checking. 

(addition 14) The text extraction apparatus according 

to addition 8, characterized in that the text line 
verification means includes: 

vertical false stroke detection means for checking 
every stroke with a height higher than a mean height of 
strokes in the text line region, and marking the stroke as 
a false stroke if the stroke connects two horizontal text 
line regions into one big text line region; 

horizontal false stroke detection means for checking 
every stroke with a width larger than a threshold 
determined by a mean width of the strokes in the text line 
region, and marking the stroke as a false stroke if a 
number of strokes in a region that contains the stroke is 
less than a predefined threshold; and 

text line reformation means for reconnecting strokes 
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except for a false stroke in the text line region if the 
false stroke is detected in the text line region, 
(addition 15) A text extraction apparatus that 

extracts at least one text line region from a given image, 
characterized in that said apparatus comprises: 

edge image generation means for generating an edge 
image of the given image; 

stroke _ima ge generation^ means fo r generating a 

binary image of candidate character strokes in the given 
image by using the edge image; 

stroke filtering means for checking an overlap rate 
of a contour of a stroke in the binary image of the 
candidate character strokes by pixels indicating an edge 
in the edge image, determining that the stroke is a valid 
stroke if the overlap rate is greater than a predefined 
threshold and an invalid stroke if the overlap rate is 
less than the predefined threshold, and removing the 
invalid stroke; and 

output means for outputting information of remaining 
strokes in the binary image of the candidate character 
strokes . 

(addition 16) A program for a computer that selects a 

plurality of video frames including text contents from 
given video frames, characterized in that the program 
directs the computer to perform a process comprising: 

removing redundant video frames from the given video 
frames ; 

removing video frames that do not contain a text 
area from the given video frames; 
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detecting and removing redundant video frames caused 
by image shifting from the given video frames; and 

outputting remaining video frames as candidate text 
change frames. 

(addition 17) The program according to addition 16, 

characterized in that the program directs the computer to 
perform a process comprising: 

determining whether two image blocks in the same 
position in two video frames of the given video frames are 
a valid block pair that has an ability to show a change of 
image contents; 

calculating a similarity of two image blocks of the 
valid block pair and determining whether the two image 
blocks are similar; 

determining whether the two video frames are similar 
by using a ratio of a number of similar image blocks to a 
total number of valid block pairs; and 

removing a similar video frame as a redundant video 

frame . 

(addition 18) The program according to addition 16, 

characterized in that the program directs the computer to 
perform a process comprising: 

generating a first binary image of a video frame of 
the given video frames; 

determining a position of a text line region by 
using a horizontal projection and a vertical projection of 
the first binary image; 

generating a second binary image of every text line 
region; 



mtiE#2 003-3090803 



mi 2002-378577 



^-v: 56/ 



determining validity of a text line region by using 
a difference between the first binary image and the second 
binary image and a fill rate of a number of foreground 
pixels in the text line region to a total number of pixels 
in the text line region; and 

confirming whether a set of continuous video frames 
are ncn-text frames that do not contain a text area by 
using a number of valid text line regions in the set of 
continuous video frames. 

(addition 19) The program according to addition 16, 

characterized in that the program directs the computer to 
perform a process comprising: 

generating binary images of two video frames of the 
given video frames; 

determining a vertical position of every text line 
region by using horizontal projections of the binary 
images of the two video frames; 

determining a vertical offset of image shifting 
between the two video frames and a similarity of the two 
video frames in a vertical direction by using correlation 
between the horizontal projections; and 

determining a horizontal offset of the image 
shifting and a similarity of the two video frames in a 
horizontal direction by using correlation between vertical 
projections of every text line in the binary images of the 
two video frames, 

and the detecting and removing redundant video frames 
removes a similar video frame as a redundant video frame 
caused by the image shifting. 
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(addition 20) A program for a computer that selects a 

plurality of video frames including text contents from 
given video frames, characterized in that the program 
directs the computer to perform a process comprising: 

determining whether two image blocks in the same 
position in two video frames of given video frames are a 
valid block pair that has an ability to show a change of 
image contents; 

calculating a similarity of two image blocks of the 
valid block pair and determining whether the two image 
blocks are similar; 

determining whether the two video frames are similar 
by using a ratio of a number of similar image blocks to a 
total number of valid block pairs; and 

outputting remaining video frames after a similar 
video frame is removed, as candidate text change frames, 
(addition 21) A program for a computer that selects a 

plurality of video frames including text contents from 
given video frames, characterized in that the program 
directs the computer to perform a process comprising: 

generating a first binary image of a video frame of 
the given video frames; 

determining a position of a text line region by 
using a horizontal projection and a vertical projection of 
the first binary image; 

generating a second binary image of every text line 
region; 

determining validity of a text line region by using 
a difference between the first binary image and the second 



tB!I#2 003-3090803 



JRfK 2002-378577 



^-v: 58/ 



binary image and a fill rate of a number of foreground 
pixels in the text line region to a total number of pixels 
in the text line region; 

confirming whether a set of continuous video frames 
are non-text frames that do not contain a text area by 
using a number of valid text line regions in the set of 
continuous video frames; and 

- — outputting remaining video frames after the non-text 
frames are removed, as candidate text change frames, 
(addition 22) A program for a computer that selects a 

plurality of video frames including text contents from 
given video frames, characterized in that the program 
directs the computer to perform a process comprising: 

generating binary images of two video frames of the 
given video frames; 

determining a vertical position of every text line 
region by using horizontal projections of the binary 
images of the two video frames; 

determining a vertical offset of image shifting 
between the two video frames and a similarity of the two 
video frames in a vertical direction by using correlation 
between the horizontal projections; 

determining a horizontal offset of the image 
shifting and a similarity of the two video frames in a 
horizontal direction by using correlation between vertical 
projections of every text line in the binary images of the 
two video frames; and 

outputting remaining video frames after a similar 
video frame is removed, as candidate text change frames. 
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(addition 23) A program for a computer that extracts 

at least one text line region from a given image, 
characterized in that the program directs the computer to 
perform a process comprising: 

generating edge information of the given image; 

generating a binary image of candidate character 
strokes in the given image by using the edge information; 

- removing a false stroke from the binary image, by 
using the edge information; 

combining a plurality of strokes into a text line 
region; 

removing a false character stroke from the text line 
region and reforming the text line region; 

binarizing the text line region by using a height of 
the text line region; and 

outputting a binary image of the text line region, 
(addition 24) The program according to addition 23, 

characterized in that the program directs the computer to 
perform a process comprising: 

calculating edge strength for every pixel in the 
given image by using a Sobel edge detector; 

generating a first edge image by comparing the edge 
strength of every pixel with a predefined edge threshold 
and setting a value of a corresponding pixel in the first 
edge image to one binary value if the edge strength is 
greater than the threshold and the other binary value if 
the edge strength is less than the threshold; and 

generating a second edge image by comparing the edge 
strength of every pixel in a window centered at a position 
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of every pixel of the one binary value in the first edge 
image with mean edge strength of the pixels in the window 
and setting a value of a corresponding pixel in the second 
edge image tc the one binary value if the edge strength of 
the pixel is greater than the mean edge strength and the 
other binary value if the edge strength of the pixel is 
less than the mean edge strength. 

(addition 25) _ The program, according to addition 24, 
characterized in that the program directs the computer to 
perform a process of binarizing a gray scale image of the 
given image in a Niblack' s binarization method to obtain 
the binary image of the candidate character strokes by 
using a window centered at a position of every pixel of 
the one binary value in the second edge image, 
(addition 26) The program according to addition 24, 

characterized in that the program directs the computer to 
perform a process comprising: 

removing a large stroke by using a width and a 
height of the stroke; 

checking an overlap rate of a contour of a stroke in 
the binary image of the candidate character strokes by 
pixels of the one binary value in the second edge image; 

determining that the stroke is a valid stroke if the 
overlap rate is greater than a predefined threshold and an 
invalid stroke if the overlap rate is less than the 
predefined threshold; and 

removing the invalid stroke, 
(addition 27) The program according to addition 24, 

characterized in that the program directs the computer tc 
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perform a process comprising: 

determining a size of a window for binarization; and 
binarizing a gray scale image of the given image in 
a Niblack' s binarization method to obtain the binary image 
of the text line region by using the window centered at a 
position of every pixel of the one binary value in the 
second edge image. 

(addition 28) - The program, according to addition 23, 
characterized in that the program directs the computer to 
perform a process comprising: 

checking whether two adjacent strokes are 
connectable by using an overlap ratio of heights of the 
two strokes and a distance between the two strokes; and 

combining the plurality of strokes into a text line 
region by using a result of checking. 

(addition 29) The program according to addition 23, 

characterized in that the program directs the computer to 
perform a process comprising: 

checking every stroke with a height higher than a 
mean height of strokes in the text line region; 

marking the stroke as a false stroke if the stroke 
connects two horizontal text line regions into one big 
text line region; 

checking every stroke with a width larger than a 
threshold determined by a mean width of the strokes in the 
text line region; 

marking the stroke as a false stroke if a number of 
strokes in a region that contains the stroke is less than 
a predefined threshold; and 
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reconnecting strokes except for a false stroke in 
the text line region if the false stroke is detected in 
the text line region. 

(addition 30) * A program for a computer that extracts 
at least one text line region from a given image, 
characterized in that the program directs the computer to 
perform a process comprising: 

generating an edge image of the given image; 

generating a binary image of candidate character 
strokes in the given image by using the edge image; 

checking an overlap rate of a contour of a stroke in 
the binary image of the candidate character strokes by 
pixels indicating an edge in the edge image; 

determining that the stroke is a valid stroke if the 
overlap rate is greater than a predefined threshold and an 
invalid stroke if the overlap rate is less than the 
predefined threshold; 

removing the invalid stroke; and 

outputting information of remaining strokes in the 
binary image of the candidate character strokes, 
(addition 31) A computer-readable storage medium 

storing a program for a computer that selects a plurality 
of video frames including text contents from given video 
frames, characterized in that the program directs the 
computer to perform a process comprising: 

removing redundant video frames from the given video 
frames; 

removing video frames that do not contain a text 
area from the given video frames; 
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detecting and removing redundant video frames caused 
by image shifting from the given video frames; and 

output ting remaining video frames as candidate text 
change frames. 

(addition 32) A computer-readable storage medium 

storing a program for a computer that selects a plurality 
of video frames including text contents from given video 
frames, characterized in that the program directs the 
computer to perform a process comprising: 

determining whether two image blocks in the same 
position in two video frames of given video frames are a 
valid block pair that has an ability to show a change of 
image contents ; 

calculating a similarity of two image blocks of the 
valid block pair and determining whether the two image 
blocks are similar; 

determining whether the two video frames are similar 
by using a ratio of a number of similar image blocks to a 
total number of valid block pairs; and 

outputting remaining video frames after a similar 
video frame is removed, as candidate text change frames, 
(addition 33) A computer-readable storage medium 

storing a program for a computer that selects a plurality 
of video frames including text contents from given video 
frames, characterized in that the program directs the 
computer to perform a process comprising: 

generating a first binary image of a video frame of 
the given video frames; 

determining a position of a text line region by 
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using a horizontal projection and a vertical projection of 
the first binary image; 

generating a second binary image of every text line 
region; 

determining validity of a text line region by using 
a difference between the first binary image and the second 
binary image and a fill rate of a number of foreground 
pixels in the text line region to a total number of pixels 
in the text line region; 

confirming whether a set of continuous video frames 
are non-text frames that do not contain a text area by 
using a number of valid text line regions in the set of 
continuous video frames; and 

outputting remaining video frames after the non-text 
frames are removed/ as candidate text change frames, 
(addition 34) A computer-readable storage medium 

storing a program for a computer that selects a plurality 
of video frames including text contents from given video 
frames, characterized in that the program directs the 
computer to perform a process comprising: 

generating binary images of two video frames of the 
given video frames; 

determining a vertical position of every text line 
region by using horizontal projections of the binary 
images of- the two video frames; 

determining a vertical offset of image shifting 
between the two video frames and a similarity of the two 
video frames in a vertical direction by using correlation 
between the horizontal projections; 
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determining a horizontal offset of the image 
shifting and a similarity of the two video frames in a 
horizontal direction by using correlation between vertical 
projections of every text line in the binary images of the 
two video frames; and 

outputting remaining video frames after a similar 
video frame is removed, as candidate text change frames, 
(addition 35) - - A computers-readable storage medium 
storing a program for a computer that extracts at least 
one text line region from a given image, characterized in 
that the program directs the computer to perform a process 
comprising : 

generating edge information of the given image; 

generating a binary image of candidate character 
strokes in the given image by using the edge information; 

removing a false stroke from the binary image by 
using the edge information; 

combining a plurality of strokes into a text line 
region; 

removing a false character stroke from the text line 
region and reforming the text line region; 

binarizing the text line region by using a height of 
the text line region; and 

outputting a binary image of the text line region, 
(addition 36) A computer-readable storage medium 

storing a program for a computer that extracts at least 
one text line region from a given image, characterized in 
that the program directs the computer to perform a process 
comprising : 
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generating an edge image of the given image; 

generating a binary image of candidate character 
strokes in the given image by using the edge image; 

checking an overlap rate of a contour of a stroke in 
the binary image of the candidate character strokes by 
pixels indicating an edge in the edge image; 

determining that the stroke is a valid stroke if the 
overlap rate is greater than a predefined threshold and an 
invalid stroke if the overlap rate is less than the 
predefined threshold; 

removing the invalid stroke; and 

outputting information of remaining strokes in the 
binary image of the candidate character strokes, 
(addition 37) A propagation signal for propagating a 

program to a computer that selects a plurality of video 
frames including text contents from given video frames, 
characterized in that the program directs the computer to 
perform a process comprising: 

removing redundant video frames from the given video 
frames; 

removing video frames that do not contain a text 
area from the given video frames; 

detecting and removing redundant video frames caused 
by image shifting from the given video frames; and 

outputting remaining video frames as candidate text 
change frames. 

(addition 38) A propagation signal for propagating a 

program to a computer that selects a plurality of video 
frames including text contents from given video frames, 
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characterized in that the program directs the computer to 
perforin a process comprising: 

determining whether two image blocks in the same 
position in two video frames of given video frames are a 
valid block pair that has an ability to show a change of 
image contents; 

calculating a similarity of two image blocks of the 
valid block pair and determining whether the two image 
blocks are similar; 

determining whether the two video frames are similar 
by using a ratio of a number of similar image blocks to a 
total number of valid block pairs; and 

outputting remaining video frames after a similar 
video frame is removed, as candidate text change frames, 
(addition 39) A propagation signal for propagating a 

program to a computer that selects a plurality of video 
frames including text contents from given video frames, 
characterized in that the program directs the computer tc 
perform a process comprising: 

generating a first binary image of a video frame of 
the given video frames; 

determining a position of a text line region by 
using a horizontal projection and a vertical projection of 
the first binary image; 

generating a second binary image of every text line 
region; 

determining validity of a text line region by using 
a difference between the first binary image and the second 
binary image and a fill rate of a number of foreground 
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pixels in the text line region to a total number of pixels 
in the text line region; 

confirming whether a set of continuous video frames 
are non-text frames that do not contain a text area by 
using a number of valid text line regions in the set of 
continuous video frames; and 

outputting remaining video frames after the non-text 
frames are removed,^ as candidate text change frames, 
(addition 40) A propagation signal for propagating a 

program to a computer that selects a plurality of video 
frames including text contents from given video frames, 
characterized in that the program directs the computer to 
perform a process comprising: 

generating binary images of two video frames of the 
given video frames; 

determining a vertical position of every text line 
region by using horizontal projections of the binary 
images of the two video frames; 

determining a vertical offset of image shifting 
between the two video frames and a similarity of the two 
video frames in a vertical direction by using correlation 
between the horizontal projections; 

determining a horizontal offset of the image 
shifting and a similarity of the two video frames in a 
horizontal direction by using correlation between vertical 
projections of every text line in the binary images of the 
two video frames; and 

outputting remaining video frames after a similar 
video frame is removed, as candidate text change frames- 
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(addition 41) A propagation signal for propagating a 

program to a computer that extracts at least one text line 
region from a given image, characterized in that the 
program directs the computer to perform a process 
compris ing : 

generating edge information of the given image; 

generating a binary image of candidate character 
strokes in the given image _by^ using the edge information; 

removing a false stroke from the binary image by 
using the edge information; 

combining a plurality of strokes into a text line 
region; 

removing a false character stroke from the text line 
region and reforming the text line region; 

binarizing the text line region by using a height of 
the text line region; and 

outputting a binary image of the text line region, 
(addition 42) A propagation signal for propagating a 

program to a computer that extracts at least one text line- 
region from a given image, characterized in that the 
program directs the computer to perform a process 
comprising : 

generating an edge image of the given image; 

generating a binary image of candidate character 
strokes in the given image by using the edge image; 

checking an overlap rate of a contour of a stroke in 
the binary image of the candidate character strokes by 
pixels indicating an edge in the edge image; 

determining that the stroke is a valid stroke if the 
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overlap rate is greater than a predefined threshold and an 
invalid stroke if the overlap rate is less than the 
predefined threshold; 

removing the invalid stroke; and 

outputting information of remaining strokes in the 
binary image of the candidate character strokes, 
(addition 43) A text change frame detection method for 

selecting a plurality^ of video frames that includes text 
contents from given video frames, said method comprising: 

removing redundant video frames from the given video 
frames; 

removing video frames that do not contain a text 
area from the given video frames; 

detecting and removing redundant video frames caused 
by image shifting from the given video frames; and 

presenting remaining video frames as candidate text 
change frames. 

(addition 44) A text extraction method for extracting 

at least one text line region from a given image, said 
method comprising: 

generating edge information of the given image; 

generating a binary image of candidate character 
strokes in the given image by using the edge information; 

removing a false stroke from the binary image by 
using the edge information; 

combining a plurality of strokes into a text line 
region; 

removing a false character stroke from the text line 
region and reforming the text line region; 
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binarizing the text line region by using a height of 
the text line region; and 

presenting a binary image of the text line region. 

Effect of Invention 

According to the present invention, duplicate video 
frames, shifting video frames as well as video frames that 
dp not_ contain a text area can be removed in a very fast 
speed from given video frames. Further, the text line 
region in a video frame can be accurately binarized since 
the false strokes are detected and removed as much as 
possible . 

Fig. 1 

the drawing which shows the configuration of the video 
text processing apparatus according to the present 
invention 
Fig. 2 

the processing flowchart of the video text processing 

apparatus 

Fig. 3 

the block diagram which shows the configuration of the 
text change frame detection apparatus according to the 
present invention 
Fig. 4 

the drawing which shows the configuration of the frame 
similarity measurement unit 
Fig . 5 
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the drawing which shows the configuration of the text 
frame detection and verification unit 
Fig . 6 

the drawing which shows the configuration of the image 
shifting detection unit 
Fig . 7 

the drawing which shows the first frame that has a text 

cpntent _ __. 

Fig. 8 

the drawing which shows the second frame that has a text 

content 

Fig. 9 

the drawing which shows the processing result of the frame 
similarity measurement unit 
Fig. 10 

the flowchart of the operation of the frame similarity 
measurement unit 
Fig. II 

the flowchart of determination of the similarity of two 

frames 

Fig. 12 

the flowchart of the operation of the image block 
validation unit 
Fig. 13 

the flowchart of the operation of the image block 
similarity measurement unit 
Fig. 14 

the drawing which shows the original video frame for text 
frame detection and verification 
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Fig. 15 

the drawing which shows the first binary image resulted 
from fast and simple image binarization 
Fig. 16 

the drawing which shows the result of horizontal 

projection 

Fig. 17 

the drawing which _ shows^ the result of projection 

regularization 

Fig. 18 

the drawing which shows the result of vertical binary 
projection in every candidate text line 
Fig. 19 

the drawing which shows the result of text line region 

determination 

Fig. 20 

the drawing which shows two pairs of binary images for two 
candidate text line regions 
Fig. 21 

the drawing which shows detected text line regions 
Fig. 22 

the flowchart of the operation of the text frame detection 
and verification unit (No. 1) 
Fig. 23 

the flowchart of the operation of the text frame detection 
and verification unit (No. 2) 
Fig. 24 

the flowchart of the operation of the fast and simple 
image binarization unit 
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Fig. 25 

the flowchart of Niblack's image binarization method 
Fig- 26 

the flowchart of the operation of the text line region 
determination unit 
Fig. 27 

the flowchart of horizontal image projection 
Fig. 28 

the flowchart of projection smoothing 
Fig. 29 

the flowchart of projection regular ization 
Fig. 30 

the drawing which shows examples of the max and min in a 

projection 

Fig, 31 

the flowchart of the operation of the text line 
confirmation unit 
Fig. 32 

the flowchart of the operation of the image shifting 
detection unit (No. 1) 
Fig. 33 

the flowchart of the operation of the image shifting 
detection unit (No. 2) 
Fig. 34 

the drawing which shows the configuration of the text- 
extraction apparatus according to the present invention 
Fig. 35 

the drawing which shows the configuration of the edge 
image generation unit 
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Fig. 36 

the drawing which shows the configuration of the stroke 
image generation unit 
Fig. 37 

the drawing which shows the configuration of the stroke 
filtering unit 
Fig. 38 

the- drawing which .shows. the configuration of the text line 
region formation unit 
Fig. 39 

the drawing which shows the configuration of the text line 
verification unit 
Fig. 40 

the drawing which shows the configuration of the text line 
binarization unit 
Fig. 41 

the drawing which shows the original video frame for text 

extraction 

Fig. 42 

the drawing which shows the result of edge image 

generation 

Fig. 43 

the drawing which shows the result of stroke generation 
Fig. 44 

the drawing which shows the result of stroke filtering 
Fig. 45 

the drawing which shows the result of text line region 

formation 

Fig. 4 6 
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the drawing which shows the final binarized text line 

regions 

Fig. 47 

the flowchart of the operation of the edge image 
generation unit (No. 1) 
Fig. 48 

the flowchart of the operation of the edge image 
• - - generation unit (Wo. 2) ' 
Fig. 49 

the drawing which shows the arrangement of the 
neighborhood of pixel i 
Fig. 50 

the flowchart of the operation of the edge strength 
calculation unit 
Fig. 51 

the flowchart of the operation of the stroke image 
generation unit 
Fig. 52 

the flowchart of the operation of the stroke filtering 

unit 

Fig. 53 

the flowchart of the operation of the stroke edge coverage 
validation unit 
Fig. 54 

the flowchart of the operation of the text line region 
formation unit 
Fig. 55 

the flowchart of the operation of the stroke connection 
checking unit 



tBSE#2 003-3090803 



2002-378577 



^-v: 77/ 



Fig. 56 

the flowchart cf the operation of the text line 
verif i cat ion unit 
Fig. 57 

the flowchart of the operation of the vertical false 
stroke detection unit 
Fig, 58 

=.... = ... the flowchart of multiple text line detection 

Fig. 59 

the flowchart of the operation of the horizontal false 
stroke detection unit 
Fig. 60 

the drawing which shows the first false stroke 
Fig. 61 

the drawing which shows the second false stroke 
Fig. 62 

the flowchart of the operation of the text line 
binarization unit 
Fig. 63 

the drawing which shows the configuration of an 
information processing apparatus 
Fig. 64 

the drawing which shows storage media 

Description of Notations 

101 video data 

102 TV video camera 

103 video decomposition unit 

104 text change frame detection apparatus 
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105 text extraction apparatus 

106, 1803 database 

301 frame similarity measurement unit 

302 text frame detection and verification unit 

303 image shifting detection unit 

311 image block validation unit 

312 image block similarity measurement unit 

313 - f rame similarity judgment unit 

321, 331 fast and simple image binarization unit 

322 text line region determination unit 

323 rebinarization unit 

324 text line confirmation unit 

325 text frame verification unit 

332 text line vertical position determination unit 

333 vertical shifting detection unit 

334 horizontal shifting detection unit 

901 edge image generation unit 

902 stroke image generation unit 

903 stroke filtering unit 

904 text line region formation unit 

905 text line verification unit 

906 text line binarization unit 

911 edge strength calculation unit 

912 first edge image generation unit 

913 second edge image generation unit 
921 local image binarization unit 

931 stroke edge coverage validation unit 

932 long straight line detection unit 
941 stroke connection checking unit 
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951 vertical false stroke detection unit 

952 horizontal false stroke detection unit 

953 text line reformation unit 

961 automatic size calculation unit 

962 block image binarization unit 
1541, 1542 false stroke 

1701 CPU 

' - - - ■ 1702 - memory - - , . . _ _ 

1703 input device 

1704 output device 

1705 external storage device 

1706 medium drive device 

1707 network connection device 

1708 video input device 

1709 bus 

1710 portable storage medium 

1801 server 

1802 information processing apparatus 
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the drawing which shows the configuration 
of the video text processing apparatus 
according to the present invention 
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[02] 



the processing flowchart 
of the video text processing apparatus 
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decompose video into continuous video frames 



measure the similarity of two nearby frames 
remove the second frame if two frames are similar 



detect text areas in the frames 
remove frames that do not contain text area 



detect image shifting between two frames 
remove the second frame if image shifting is detected 



/* generate edge image for candidate text change frames 



generate stroke image based on edge information 



stroke filtering 



connect strokes to form text lines 



text line filtering 
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text line image rebinarization 
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[HI3] 



the block diagram which shows the configuration 
of the text change frame detection apparatus 
according to the present invention 



input video frames 
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frame similarity measurement unit 



text frame detection and verification unit 



image shifting detection unit 
candidate text change frames 
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[SI4] 



the drawing which shows the configuration 
of the frame similarity measurement unit 
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301 



image block valiation unit 



image block similarity measurement unit 



311 



312 



frame similarity judgment unit 



313 



mtE#2 003-3090803 



mm 2002-378577 



^-v: 5/ 



[15] 



the drawing which shows the configuration 
of the text frame detection and verification unit 
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fast and simple image binarization unit 



text line region determination unit 



rebirarization unit 



text line confirmation unit 



text frame verification unit 
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[126] 



the drawing which shows the configuration 
of the image shifting detection unit 
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fast and simple image binarization unit 



331 



text line vertical position determination unit 



332 



vertical shifting detection unit 



333 



horizontal shifting detection unit 
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[0 10] 



the flowchart of the operation 
of the frame similarity measurement unit 
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[mil] 



the flowchart of determination 
of the similarity of two frames 
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into nBlock nonoverlapped 
image blocks with size N x N 
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the flowchart of the operation 
of the image block validation unit 
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Calculate the mean and variance of 
the block pair : M(i), M(j), V(i), V(j) 
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the flowchart of the operation 
of the image block similarity measurement unit 
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Calculate the mean of the 
block pair : M(i), M(j) 
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Calculate the correlation of the 
block pair C(i, j) 




S533 



similar image block pair 



S534 



dissimilar image block pair 



tiilE# 2 003-3090803 



&m 2002-378577 



^-v: 14/ 



[114] 



the drawing which shows the original video frame 
for text frame detection and verification 
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the drawing which shows the first binary image resulted 
from fast and simple image binarization 
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the drawing which shows 
the result of horizontal projection 
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the drawing which shows 
the result of projection regularization 
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the drawing which shows 
the result of vertical binary projection 
in every candidate text line 
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the drawing which shows the result 
of text line region determination 
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the drawing which shows two pairs of binary images 
for two candidate text line regions 
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the drawing which shows detected text line regions 
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the flowchart of the operation 
of the text frame detection 
and verification unit (No. 1 ) 
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the flowchart of the operation 
of the text frame detection 
and verification unit (No. 2) 




remove these false candidate 
text frames 
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the flowchart of the operation 
of the fast and simple image binarization unit 




divide the video frames into N x N 
nonoverlapped image blocks. Store the 
number of image blocks as nBlock 
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Niblack image binarization 
for the i th block 
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the flowchart of Niblack's image 
binarization method 
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the flowchart of the operation 
of the text line region determination unit 
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projection smoothing for Prjv(i) 



projection regularization 
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region height. Record the start 
and end point as sx(i) and ex(i) 



output the sx(i), ex(i), sy(i), ey(i) 
as the region of the text line 
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the flowchart of horizontal image projection 
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find all local maxima 
in the projection - 



set the value of regularized 
projection Prjr to 0 
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set the values between 
min l(i) and min2(i) of Prjr 
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the drawing which shows examples 
of the max and min in a projection 
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the flowchart of the operation 
of the text line confirmation unit 
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the flowchart of the operation 
of the image shifting detection unit (No. 1 ) 



image binarization for frame i and frame j 
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record the position of the maximum 
value in Cv(t) as offv 



projection regularization for Prjyi results 
in regularized projection Prjyir 
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find the candidate text line regions 
in frame i using Prjyir, record the 
vertical position of the text line in syi 
and eyi, save the number of text 
line regions as nCanTL 
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the flowchart of the operation 
of the image shifting detection unit (No. 2) 



Determine the c th corresponding shifting text 
_ . line region in frame j by- 
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calculate the vertical projections of two 
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the drawing which shows the configuration 
of the text extraction apparatus 
according to the present invention 
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the drawing which shows 
the configuration of the edge 
image generation unit 
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edge strength calculation unit 



first edge image generation unit 



second edge image generation unit 
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the drawing which shows 
the configuration of the stroke 
image generation unit 
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the drawing which shows 
the configuration of the stroke filtering unit 
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stroke edge coverage validation unit 



long straight line detection unit 
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the drawing which shows 
the configuration of the text 
line region formation unit 
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the drawing which shows 
the configuration of the text 
line verification unit 
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vertical false stroke detection unit 
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horizontal false stroke detection unit 
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text line reformation unit 
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the drawing which shows 
the configuration of the text 
line binarization unit 
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automatic size calculation unit 



block image binarization unit 
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the drawing which shows 
the original video frame for text extraction 
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the drawing which shows 
the result of edge image generation 
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the drawing which shows 
the result of stroke generation 
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the drawing which shows the result of stroke filtering 
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the drawing which shows 
the result of text line region formation 
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the drawing which shows 
the final binanzed text line regions 
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the flowchart of the operation 
of the edge image generation unit (No. 1) 
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the flowchart of the operation 
of the edge image generation unit (No. 2) 
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the drawing 
which shows the arrangement 
of the neighborhood of pixel i 
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the flowchart of the operation 
of the edge strength calculation unit 
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the flowchart of the operation 
of the stroke image generation unit 




stroke image pixel value 
calculation based on Nihlack's 
image binarization method 
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the flowchart of the operation 
of the stroke filtering unit 
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the flowchart of the operation 
of the stroke edge coverage validation unit 
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the flowchart of the operation 
of the text line region formation unit 
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the flowchart of the operation 
of the stroke connection checking unit 
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calculate the number of 
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intersection with both 
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the flowchart of the operation 
of the text line verification unit 
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the flowchart of the operation 
of the vertical false stroke detection unit 
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the flowchart 
of multiple text line detection 
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the flowchart of the operation 
of the horizontal false stroke detection unit 



calculate the mean width of the 
strokes in the text line 
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the drawing which 
shows the first false stroke 
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Text line 1 



Text line 2 
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the drawing which 
shows the second false stroke 
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the flowchart of the operation 
of the text line binarization unit 




determine the binarization window » _ 
size, wh, based on the height of the 




bin arize the text line region using 
edge based stroke image generation 
method with window size equal to 
wh 
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the drawing which shows 
the configuration of an information 
processing apparatus 
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the drawing which shows storage media 
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Problem -to be solved 

To select the candidate text change frames from a 
plurality of video frames in a fast speed and accurately 
detect the text region in the text change frame. 

Solution . . - . 

Video frames that contain text areas are selected 
from given video frames by removing redundant frames and 
non-text frames, the text areas in the selected frames are 
located by removing false strokes, and text lines in the 
text areas are extracted and binarized. 
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llOf^X hf i>y'7l/-AtH!Sfl±, il<7)7!/-Ai*^ 1 £2 
4LtHrt7 V-Uirh-T*7, |>n>f > ^ & t'ft "7 — A 

[0 0 16] 

#2<3-r** h^i>v7^-Atfcmg««4, HHI^n y * m?htt&WL, lift 
x. hfitz\£ 7*^-7 V-Ufrh7-*7, Y ^ > r- > V Z'a&m^kcO^Tj- 7 U - A & 
[0 0 17] 

7Ix-A|*|<7)|B]£{£ttK&& 2 ooHftT'n y ^7&\ Hit ^ >x >y Ogg-ft*^ 

tarn y ?s<T<Dmmztt-fz>Mwmi%.7°v ? ?mk<n\t*m^x 2 o 

U-A*&£Ltz'&l<zmZiXtz\sr*7U-AZMffi-T*X Ffx>y7 P-A 
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[0 0 18] 

i30f+xhfx>y7u-Attijifiii, »aittmm«=ntft#s, x+ 

^F7^ >fM&7E^I£> ft-mt^&i. f*X^7^ T-^r^ h 7 

> IC $ 2 <D - Ht® ft Sr £ & o x * X h 9 >f > mZ^WL l± > & 1 O-ffi 

®ft^H2c7)z:'|i®fto||t, h 9>f >fIit[*i<7)®^oim^^-r^^o 

[0 0 19] 

f*7P-A*in?t4o S&W&Hflritefl^&wu *fz.btifz¥r*yu~- 

±U2'D<Dh:'r*yi'-A<Dffl{ztelf2>mi$.i'y Y<r>m*7"ty h Ziih<D 
2^<n\£T*y\/-2*<7)mii\ft<r>mQ±&b*8kfc-fho «y7htffi^gi±, ± 

%^2^<D\£T*y]/-A<DZLmm\%p\\~iotfh7-*7. y^mm^owM 

£ffiv*t\ m\fyy K0tt7-b y h fnt)02ooifft7l/-A^ 
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[0020] 

[0 0 2 1 ] 
[0 0 2 2] 

^ y vm\^m^mt, ^-^hntzmm.^^ ? vit &<> x h n - ^ 
y $m<Dizmmm £ -r & 0 

[0 0 2 3] 

x y viiftfiox .j, v^^-r®^ t±tef^is^x h n - ^ ^zm® 

*§tt*U;r-e-OX hn-^^W?>tjx hn-^t^L, m«^ri*#r5eO L # Wit 
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«fc *)'^Zlitil£-£<DX hn-^|^ hn-^t^LT, hn-^£ 
[0 0 2 4] 
[0 0 2 5] 

- ...... IBm^MM^MWt} 

> St^^f 7*-*T—9 1 0 1 ifclifl/H' (TV) If 7^77^ 7l 0 2 

3 U i lx-AH5^ft?$tL^>o fit, T-^r^ h^-^>v-7 1x-A^ 

otfbtt^o t-^t^ h*-* yyy is-^femmwit. mMmmmzMs^mmir 

Z>ZtlZ%Z> 0 ^(D\k, "T*7s hfflttl^gl 0 5 1:, iif^r^ h7-x->i?-7U 
h^-f ><?)!ift££ "b^^OCR^SOTtz^tCr-'-^^-x 1 0 6 Cffi^ 

o 

[0 0 2 6] 

i2li, HI <D\£ T*t*7* hjllii^i7D-ft- h ^^Ltv^o 
S 2 0 1 0&S(ieft7>i^ 1 0 3 =t lOftfrti, S 2 0 2^f>S 2 0 4 iT 

S 2 1 0 JWMlir^r^ hffim^g 10 51a *)ftt>tLZ 0 
[0 0 2 7] 

i1\ A^^^t/clfT-^^BMLfc^P-At;^^^^ (S20 1) o *L 
-7 1- - ASittJtftpj^qTt>*v, 2 ooB¥ra±o 7 1/-A OtfiUBLJK^aJ $ ti 
h (S 2 0 2) o ftlf><0 2O(7)7l/-A^||f tt^lf, 2#i<7)-7Ix-A 
m&^Ztlho ^fctC, T-^r^ h "7 P-Al^tB^ i WfjE^T^btL, S 2 0 2^ 
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It^$^7l/-A^f*7 h7^f >£^fr7^£^7& ? ¥iJ^£n£ (S 2 0 3) 

^7^#:;e£*t£ (S 2 0 4) o i> L-e^ -eabtL(i\ 2#S07I/-A^Tt*$tL 
£o x** 1 0 4 <7)tfi7J(i. -i^fglif r-^rX h-f- 

^ y-j-7 v-uttz%>o 
[0 0 2 8] 

vSft^fife^n^ (S205) o fLT, X hD-^4«t)ii, -x. y ->'if 
ilHlfe^v*-C^ h n-^®ft^^^ti^> (S 2 0 6 ) 0 

)V9 y sftfftfofi, x 7 y'lf iHlo'v^H F n - ^«i^tL^, (S 

2 0 7) o HCf^X h7^f ymi&fcJ&tfftfrti, fl^^hn-^m? 
*IT 1 oOr-^rX h^-f >tc^^> (S 2 0 8) o r^r^ h 9>f ym£& 

(S209) o «?£ic, f+T.^^riW^^ 

1IiEf&?fr J £j£$*L& (S210) o <ll3-ffi2jf±. B*OtAKOCRx>^>t: 
[0 0 2 9] 

HI 3 li, 121 1 cox^^ Kx>y'7 l^-A^m^S 1 0 4 0i)S^ltv^ 

is-^mwmttm^3 o i tzmhti, -r** hyu-^mPMm3 o 2 zm^ 
- a wMmmm 301, r^h^-A ^tb^SEgp 302, &x una it v 7 

HfctiJSI$3 0 3 W\ Wnil, £2, *J,t0 f i3O7V'-A^t(:^ 
-T^o f + ^hfx>y'7P-Atlffiiil'0 4li, e^-xy^fft^Of 

bmi&*^&r^%^\ST*7U~Atti>lZ > mmLT\<*&}S^*-7U-A}3 

i^/7 h Lt^^m7U-A^, flsv^J 3-;^£|t#L&7^^#^jt5 
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[0 0 3 0] 

vimmm3 o 1 mmru > y ?^$MtU3 1 i , intra ? *«ijgftM 

3 12, i3J;^-7^-^^«*]^3 1 3*^ts 0 Wfefu y ?#£MfcS|$3 1 
1 (±, 2o<D\£7**yi'-Aft<Dmc&W.l~*>& 2 ocoMftyn y ^« 7 - n 

^ntt}<nhZ>m\%?u 7 ®f\ rn 7 ^MUKittJMSB 3 1 2 li, W 

Il7n -y ^^T<7) 2 ooijftrn y ? <7)MWJ£Znn U ^tt<b<7) 2 oofffiHtr 
n7^«Ltt^^M^tl,o 7lx-A«»J^3 1 3(2, ^ 

j&*j»JR Lt</^^§^^ ^^-T & o »JlJtftM 3 0 nan 

O 

[0 0 3 1 ] 

®5(2, gj3 cof-^* h -7 U-Afem&MU3 0 2 <7)1f^^^LTV^ 0 -f* 
Xf7P-AM»3 0 2(i, i§5i§^®ftn{Ifk^3 2 1 , T-^r^h^-T 
3 2 2 , ftfk^ 323, >fitl£gR 3 2 4, £ J; V 

r+^h7 V-A*£fiE^3 2 5 £^tf 0 i^iSWl^lJft— ffiftSlS 3 2 1 (i, \£r* 
VV-U(DW>\(n~mM\^^-tho -r^rX h 7^ ym®$k7g.U3 2 2 (2, fjl 

i <D-mmi%<vmMmtmmz*m^*xf-*x v u > fifths £&7E-f£o 

* h n ymmu3 2 4 «i, n i <7)~mmi%tm2 (DZLmmi^mt, -r^^v 
u >v¥Mft<owm<Dmmzn-t&^<Df-*x y y 4 ymm^y * r vy > k 

h 7 Ix-A^ilg|5 3 2 5(i, 1 a^ilMi" & tff 1 't7U -A [*1 ©tif + X h 7 



mSE# 2003-3090803 



Ofcm 2002-378577 



^-v: 14/ 



[0 0 3 2] 

126(2, H3 M£tB£R3 0 3 «*^LTi/^ 0 IHtv^h^tti 

§15 3 0 3 (2, it^fifMft-<fitfl:^3 3 1, 7-**h?4 >«lfig^g|5 3 3 2 
, Iv7 h ^tJHSB 3 3 3 , M&thg|$3 3 4 £#tr 0 Bjiffi^Hififer 

ffi^H53 3 1 (i, 2oco^T-^7lx-A^-jfi®ft^^fiS;-r^o f^r^h7^> 
3${£«$:7E§R3 3 2 (2, ^tih<7)~mmi£comM§Zi ffl^r t- * x >ffti 
^OJHSS^i^TEi-^o tt~>7 h&fcbSB3 3 3 (2, tli-f^i^ffl^t, ±ffi 

<D\zriryis-A<Dmiim<DmMmtZi}t7£irZ> 0 |->7 M&tiigK3 3 4(2, ± 
[0 0 3 3] 

H 7 £2 0^8 (2, |WJi:x^rXh3>T-> , 7 £^Tl-& 2 OC0"7 lx-A ^TKtTV^ 
&o 119(2, «m^>^2o<7)7lx-A(c^-r^7^-i,^«ftPJ^3 0 1 (Dm 

[0 0 3 4] 

dl 0(2, H4 0-7 1x-A^«ftiiJ§|5 3 0 1 coftljfj^n-^- f>-e&& 0 

g<7)-7 STEPfl07l/-Ai7)7 U ~ ^fflMUl-f & jf|W7l^- 

A tib^tl^ (S 5 0 2) o 2o<7)7V-A<7)Jt!i£(3&^T i f§<7)7P-A 
#S07lx-A<!:^LTV^tL(2 (S 5 0 3) , ift(0 7 I/- A{£B^ j # 
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|«7l/-Ai:yt>^U, 5km<Dtztb<DS 5 0 2 isiO'S 5 0 3^)ftPi 
[0 0 3 5] 

2 o<£>"7 lx- A7^&oTV>tU;£\ ^it^^ Ix- A<D 1 7 1/- Af£<7)7 U- A 
^Jt^ 5 f?llf§£*U f07 V-A«ki@<7)7 lx-A ( S 5 0 4 £ <fc 

S 5 0 5) o kfr j £ ^/h^i/^gtf^-f-i -y ? (S 5 0 6) 0 kf§<7) 

•7 1/- A#* j # g <7) 7 V- A (Dm -Ch *K if |<^7l/-A^kf g(071/-A 

tmULX^'Pr^jS 5 1 1 ) , k#g(7)-7 1/-A«^7 U-A 19 (S 

5 12), Jt«e<7)7ti6<7)S 502*3«t^S 50 3^I^S^I?tL-i>o 
[0 0 3 6] 

if|<7)7I/-A^kf@07l'-AfcMoTv^l^ k# s l /iltiiflnL ( 
S 5 0 5) . k** j «t y ^ $tL4 e k#* j J: < fctt 

*U£\ jf@07 1/-Ali-enM(?)7 1/-At(i||^-&Ct^ML, j#S 

<7)"7 ^-A{±ff7t^f^|ix^r^ Ffx>y71/-At LTV - ^ (S 5 0 

•7) o fit, jfB07I/-A^t,fffc4«f#J6i5 (S 5 0 8) o i!£<7> 
1&f?W-A(7M >Ty?X i £ S TE P <7>fDrt s A# If 7** 7 l^-A^i 
SnFr ame <fc QJz&lftilX (S 5 0 9) , ^(i^t> *) , fo^ofciHlf 

[0 0 3 7] 

7U-A MRS STEP O^EKi, 1&^tfrfi^#W[M « :tt*l»o 

STEP^^tlw tf^*<art#i±.ftKg?ffcu tt^^Ti-^-t^^ 

<7)7U-&mmt Ltl±, 1?!lxJ£\ S TE P = 4 7 1/-A (i'tl & 0 
[0 0 3 8] 

HI Hi, EI10OS503 CCisit* 2 7 1/ - A (D W<7)»lf^7 
n-^-v- h ^Ltv>-5o S 5 1 1 \z&tf 2>®M<T>7 u ^ - Hi, Hi 1 
K33V*T j £ k ClIM^- tX~'&hti2>o 
[0 0 3 9] 
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=t Wl7"n 7^^>FnSimi larli, 1"^T 0 UK5c$n* ( S 5 1 
3) o fit, if|W7U-At jfg^)7l/-A^ NXN^omilOM^lL 
fcvvhS&Mf&T/n y *n##J£*i, fixf>«lt7n^«nB lock 
i:LT|Eii$tL^ (S 5 1 4 ) 0 N= 1 6 XhZ> 0 2O071/-AR© 

i^m^-ru y mferv? tfcmztiz> 0 -e lx 

^7^tf707 ^^7t*^S*^fi -y * (S 5 1 5) o liT'n 

[0 0 4 0] 

7n f<T-f)m%]X$>thl$, >kC07*U y^^T^sc y 7^flh (S 5 1 9 
£ X 5 2 0) o 7n y ^^7^'ti7n y f<TX$>tU£, ^tffofu y ? ~h 
7>hnVa 1 i d#l tzlfmn L ( S 5 1 6 ) , Mftrn ? ^ »JlJgftSlJg|5 3 

1 2^fflv^r, 2 ^<nmi%?u y ? (Dmiimmimzfrz (ss 1 7) 0 fti^ 

oyn 7 ? WJlLtv^lf, M7D 7^^>FnSimi 1 a r j&* 1 rtf»t 
WtiU-t2> (S 5 1 8) o t^T<7)7o -7 ^TriiJtffcSflfcfc § (S5 1 9ioi 
^S 5 2 0) > 7l^-A^JS*iJ^g|5 3 1 3 £fflV>T. 2o©7P-A«L 
Tv>*^5^*^5g$*L4 (S 5 2 1) o <k<?>4ktirt*mt:ZtLtUi. 2^<D~7V 
-A&mULX^2>tfr%Zti2> (S 5 2 2) 0 

nSimi lar>nVal i'd*s imrate 

Mz.it, simrate = 0. 8 5Xhh 0 ±MM^mtz £ flfcJttUjf, 2o 
<D7l'-J*mmMkfr%Ztl2> (S 5 2 3) o 
[0 0 4 1 ] 

HI1 2 li. El 1 1 <DS 5 1 5 Kfclt&Bffc^n y ?&mtU3 1 1 0»^7D 
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-ft-h^U^o if, n#gc7)!Htyn -y <D^t^mmU 
ZttZ (S 5 2 4 ) 0 i #BC07 lx-A<7)!Hf7*n -y ^ co^i^flt^^is J; ^ 
ifcfi, -etL^M (i) (i) <hfS£*i& 0 }mB<D-7U-A<Dmi%7 

v> ?<nmMfe^ty&z.mffc\$. tti^tiu (j) jo^^v (j) <hie$ti 

£ Q -y ^T0 2o<7)55-ifcV (i) &£lfV (j) m^Pif£WL^> 
iTvJ:.l)/h$< (S 5 2 5) , 2o^M (i) fcJitfM (j) <0*& 

t £Br5e<£> L£WlTm=t O/J^ttn^ (S 5 2 6) , -e<0®ff7*n y ^ 
^riiIl7"D7^^ri:4^ (S 5 2 7) 0 n "CfclttUiT . ^r<DW®>y*u y 
MTmyo-^^Ti^l. (S 5 2 8) 0 
[0 0 4 2] 

(U 1 3 (±, EI 1 1 CO S 5 1 7 C£tt£lHt7*n y ^ii«fUW3 1 2 cotfr^ 
7D-f t-F^-^ltv^o n#gcoiift7 -> n -y ^^rco^pi^M (i) &£Zf 
M ( j ) t^S fit jf £ (S529) o 2 oco^M ( i ) i3 <£ ?/M ( j ) <D 
^tM^Bf^co L^v^Tm 1 J: *)±£tftil£ (S 5 3 0) , -ettib <7) 2 o<7)lj 

&7'u y ? mtmiLm&ru y ? t&%zti& (S534) 0 -e^-c&tt*u£\ 

2-0(Dm{%L7*U y 9<DmmC (i, j) ^ft#$tL^> (S5 3 1) ofilC (i 
, j ) t^p^cT) L^v^T c i f^itttUi" (S 5 3 2 ) , ^tih<D 2 ocoUft 
7*n -y ^(i^ilLTV^ <h<7*&£ti (S 5 3 3) , mfMC (i, j) ^L^fii 
T c J: *)/h£tt*U£\ -en^c0 2o<7)lifty"n y ^(i^mt^4^tL^ (S5 
3 4)o 

[0 0 4 3] 

Ell 4^^112 1 fVit, I5«f^r7 byU-A^.W,^MU3 0 «)fr 
[HI 6fi, irf»IS*^^Ltv^o Ell 7f±, Mf^Ef!iHto|£*£7KL 

[0 0 4 4] 
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H2 Oii, HI 9 ^£>v>TMltc <£ ()-7-^^tL/:2o^iit + x h 7^ > 
II^<7)~'(i®ft<7) 2 oo^TOlS*^^ Lti^o ft#JO^T fill ftfi-r* 

, ^Ot-^t^ h 7 4 >flitfi, K^-T^r^ h 7 A >WMt&t£^tl& 0 2#@<7) 
[0 0 4 5] , 

[H2 2 & «t C>*2 3 (i, 13Wf^x h 7 A^m^fiE^ 3 0 2 (DMfty u- 
Y^Jf.LX^ho if, lif *fli7 l/-A-t y 3 yO^ff^n 

, ^u-A^mjsftiiijsi5 3 o i vmyiztitzmftT** y yu-nt, — & 

OiSM-f&fgffi? W-A^-ett-rtL-g-tr^iSc^-lr^ y a >H:5Hg£*i& (S 7 0 

1 ) o CltL^co-b^ v 3 >o|gc(i, n S e c t ion tg£$n*o MVKD-k ? v 
a >j&»*b$&tf>T (S 7 0 2) % i#SO-t^^>3 >c0ilMi-^f«-7 1^-AOm 
M ( i ) ^^<7) L^>fT ncfi 0 lt*U* ( S 7 0 3 ) . i^igffi^iffift 
mift$3 2 1 SrfflV^T, -f <t^m7 l^-A<7)^.-'fii®ft^*i6^tL^> ( 
S 7 0 4) o ZHftH«0*i5J:yfffi*ff^*ffifflLfex*^ Y ? <i 
^3 2 2 ^:1^ h ><7JT$Mtf®:7zZtl& (S 7 0 5) 0 

[0 0 4 6] 

t(tS$^f^f?'f>^^t (S 7 0 6) , f?-<fIffcSB3 

2 3^t, f^rX h >^0 2#BOrii®fa^fi£$tL < 5 (S7 0 7 
) o nr.mitU3 2 3(i, Itiil^ti^-r^rX LTNiblac 

oo-fiMftti, t-*^ Y 7 4 >mUM3 2 4 izX *) Jt$c£ft& (S 7 0 8) o 
^ti£><7) 2 0(7)-filijft^mLTv^(i\ i flM^y a >Wf^X h 9^ 
>^nnTextLine ( i ) ri* 1 Itif^i" & ( S 7 0 9 ) 0 -<^JHS 
f±M (i) liOiS^f" Lti7 l^-Art<7)i-^T<7)x^x h 9>f ^CWLtiO 
(S 7 1 OiScttfS 7 1 1 ) o 
[0 0 4 7] 
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(S 7 1 2) , 
nTextLine (i) ^ a M (i) 

ztihote*) <7>ffl&7-*x b "7 v-a»±i^*$*l* (S713) o «{± 
, i 0&£$*i&iE<of£8rcab&o «#i±, « = o. 8^$w, - 

7 1 4 & Xl/S 7 1 5) o 
[0 0 4 8] 

@2 4lt E122<7)S704 IC^tt^^iS^aHft— -fiUfcgB 3 2 1 ©lft7D 
-ft-F^Ltv^ 0 7U-A®f^fi, S-f NXNOti<7)ilL^v« 
7o^^H$tl, Ztih<Dm\%.ru y ?<nmS-n B 1 o c k £ LTlSil^tL 
£ (S716) o N= 1 **0<OlBffe-/n y * frhfetbX (S 

7 17), Niblack Olift—'fit'fk^m: ») IKf/n y ?m^mtZtiZ> (S 7 
1 8) o Niblack (Dmi&^mttO^^*-? k(±, k=-0. 4 Hf£5e£*L& 0 
-co#M«ii-^TO®ff7'D -y ? tc^LTM^^I^tL^ (S 7 1 9i3J:LKS 7 2 

0) o 

[0 0 4 9] 

i2 5li, H2 4<?)S 7 1 8 Kisit&Niblack <7)mi%.~mt&<7) 7 u -v - 

a nii£ tfftWLV a r £ ti& ( S 7 2 1 ) 0 ^tlcV a r 7& s Bt5eO L § vHfi 

T v «fc *)/h£lt*U;T (S 7 2 2) , -eozHiMftrtWt-^TOIi^O lw^$ 
*i& 0 Va r>Tvtfeti(i\ <9 '*>f ^ 'J L # v^fi T7$ ? ftjf $ 0 
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T=M e a n + k * V a r 

®f£Ii3f i IfcK. ^ffiitO&fti&fifg r a y (i) X *) ± § tttUf (S7 
2 6), -ecozi-fitlilftb i n ( i ) ftcDmm&O Uf£^£*l£ (S 7 2 7) x 
d-e&tttUif, (S 7 2 8) o -co^ji|S{±-fI®ftl*l 

<D-$-'<x<Dmm^MLxm , o&zn2> (s 7 2 g&eto^s 730)0 

. [0 0 5 0] 

0 2 6(2, 0 2 2<7>S 7 0 5 Hiott£ f^fX h^^>II^^3 2 2 cot&fP 

& (S 7 3 1) o -<7)#t!£f±¥?#ft:£*iT (S 7 3 2) IEf!lHt£*i£ (S 7 3 3 
) o P r j h ^iEilJftL^^^tiP r j h r h & 0 , 0 tfcfi 1 W2o<o|W 

/h Iti^t £ £;ti*1-£o P r j h r ^&tt£^- lOfl^cO^irio 

fWtisy (i) isXlfe y (i) <h LTf2§l£ti£ (S73 4 
) o P r j h r 1 OfW^MLT, ^BftMf^P r j v ( i ) T^ItH 

$W (S 7 3 5) 0 Pr jv (i) (i¥<t^$tLT (S 7 3 6) IE|lHt£*i, 
Prjvr (i) hta%> (S737) 0 Prjvr (i) r*3^ 2 oco 1 <£>^i£M 

omsi75 ? 2 *fi^<7)is$ J: ^9/J^$^tn^i\ -eti<b<7) 2 i <Dmm*m$i£ fix 

1 oofUt t & «9 , J: tfifc-.Sri*, ^ft-rti s x ( i ) *5 «t 

Vex (i) £LTfEil£ti& (S 7 3 8) 0 Zfib<Dltit) s x (i) , ex ( 

i) , s y ( i ) , iS X If e y ( i ) K X I) , i #@ <£>f h 9 ^ >OfSit# ? 
5Efe£$*L& (S 7 3 9) o 
[0 0 5 1 ] 

02 7(2, 126WS73 1 Kfc»t*#liffeltf&<Z>7 n— f-*- h^^LT^ 
&o mn<Dmy^ yfrbfenbx (S 7 4 0) , i fioi7-f >izn-t2>tt&i> s 
>k^Z ±*)tm2ti2> (S74 1) o 
[0 0 5 2] 
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mi] 

[0 0 5 3] 

---e, i o, j) it i ft j wwmmx$> *) , w(i®ft<7)iiii-e*^o -<ott 

®ft<Oi§5££ h £ LX, Mf^O-t^TO^ > LT££ i9 £ 
(S742*5J:a f S 743) 0 
[0 0 5 4] 

EI2 8 ti, EI2 6 OS 7 3 2 iztotf Z>M&¥mt<D7 u ^ - h ^^Lt^' 

^»o ^fi-ft^ y v^(D^d Kmm-tz ^h^x (s 7 4 4) , ^-mm 

i ^@ (D&(Dm p r j s ( i ) tf^jUc J: *) tfWZtiZ ( S 7 4 5 ) 0 
[0 0 5 5] 

[&2] 

2d + 1 jtZs 

[0 0 5 6] 

M^LtLX, ^fk#tf^-r^TO*J^LT^f)^^tL>5, ( S 7 4 6 *3 «t 
S 7 4 7) o 

[0 0 5 7] 

112 9 14, 126WS 7 3 3 Ic^t^lti^EIlHtO 7 n ^ - h £tkLT^ 
£o S#JK, Stf^O-r^ro^Ut^ttl^ti^ (S 7 4 8) o lEWlitM^P 
r j r <7)-t"-<-r<7)lii^(7)fi(i 0 IzmTzZtlZ (S 7 4 9 ) 0 t^<7)IMm a x 
( i ) hi&t&X ( S 7 5 0 ) , I" CiS < (7) 2 0(7)^/Hitm i n 1 ( i ) & 4 V s 
m i n 2 ( i ) rt^m $ tL^> ( S 7 5 1 ) o 
[0 0 5 8] 

El 3 0 (4, Iti^ffi&llCfelt&m a x (i) , mini ( i ) „ &£Zfm i n 2 
(i) (D&WOffimMZTnLX^^&o 3 001^^1X1/^0 P2, P4 
, i3i^P6(±, Wtimax (1) , max (2) , iSi^max (3) t 
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i"*o PHimax (1) tC^i-^>±fIlj(7)^/Hjtm i n 1 (1) "Cab *9 . P 3 (2 
max (1) t;*t1-£TflJJ^ii/JvfIm i n 2 (1) -C&<i> 0 P 3 Urn a x (2) 
K*j"t"&±ffiO<7>@'Mtm i n 1 (2) -C&ab&o IrM*^, P 5 fim a x (2) \Z 
n~$-&T®\<7)m'HUm i n 2 (2) t?* 0 > max (3) H*H*& ±«Ot'M 
mini (3) P 7 lim a x (3) Kift&TffiiJ^&'JHfim i n 2 ( 

3) -Cab-So 

[0 0 5 9] 

mini ( i ) <m a x ( i ) / 2 , min2 (i)<max ( i ) / 

2 T-a^tKf (S 7 5 2) , mini ( i ) $3 £ Xfm i n 2 ( i ) okk'^MOiE 
HMfc«tfl*P r j rtfMItil KtfcfcStia (S 7 5 3 ) 0 -O^MSH^ «^ffi4^H 
It^iE^ti^ (S754i3J:D f S 755) 0 
[0 0 6 0] 

I3 1IJ, H22OS708 Hiott^T-^^ h 7^ >?tiS^3 2 4 (Dft^7 n 

>f >fp£<7)wx h«W 2 o or fit lift I 1 JsiO* I 2 "Cab&o * > 

^countK count2, fcWc o u n t^0 KW&Ztl& ( S 7 5 6 
) o c o u n t f±. I 1 £ J: Xf I 2 ft<7>*j-j£1-& 2 ooif(Df^f*> «b 1 "C 
«t 9 ^li^<7)m^S:i*i--5>o countlli, I 1 ^M^-fhm^m.^ 
I 2^W } 0-Cft*J:T!5:Hf^ti*tao c o u n t 2 li 

, 1 2 rto^s-t-^ffl^ofls^i -c& >) > i lrtow^ti-si^iito 

[0 0 6 1 ] 

2 o<7)®ftftOfll#J<0{£M7^^i6T, MEt^2o«I 1 (i) 
I 2 ( i ) t i> 1 -e*tL(S\ count^'l fittittJOi - ^ (S 7 5 

S 7 5 8 ) o fn^v^, II ( i ) 1 T-abtitf, count 1 1 fi 
ttiiip-t 4 ( S 7 5 9 £ X Xf S 7 6 0 ) 0 * ^*§£\ 12 ( i ) ^ 1 T* 

&*U£> count 2^1 /ittiiJ!«Ii-* ( S 7 6 1 £ J: £>* S 762) 0 f<t<?) 

a. y * Ztltzfe (S763*3«fcO f S764) , ^W^fWl/: £ 
S^^i y ? (S 7 6 5*3 XXfS 7 6 6) 0 
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count+count 1 < w * h / 2 , 
c oun t + c oun t 2 < w * h / 2 , 
count l<count*0. 2, 
count2<count*0. 2, 
f i 1 1 r a t e < 0 . 5 

-Tit* ^'f >yfMp '_f i 1 1 rate' li, -£<Dffiffi.fa<7)mmm%L^M1r 

, 2o<7)-fiiift{iC(7)7"^rX h 94 >®i£l*|K£v>T^lftLT^&i*fc$*i 
. ^<7)9*^ h 9 4 y^m*^1$}7-** Y 9 4 yt^tzZtih (S 7 6 8) o - 

h 94 >fc*$rS*L* (S 7 6 7) o 
[0 0 6 2] 

EI 3 2 ££(^3 3 fi, HI 6 O lift v "7 M£tB2G3 0 3«7n-ft- h £ 

#><btl£ (S8 0 1) o*:, ^ U-A i $>£.XfyV-A j HW1"^m#tf^P r 
j y i i3XX/P r j y j ^^fi^ffi^.tbZ>tztbK^ 9** h 9 4 > Mitt^SB 
3 3 2£fflv>T, 02 6 (OS 7 3 1 OJ: o **lififelt^*ff *>ti4 (S 8 0 2) 
o fit, SS->7 F&toS&3 3 3 £fflv>T, 2oo|ffW4ritCy (t) ^ 
ftJf£*i£ (S 8 0 3) o 
[0 0 6 31 

2o«P r j 1 (x) jS-fcO'P r j 2 (x) OfflHM&C (t) 
[0 0 6 4] 

m3] 

C(<) - Lm y imy2 L(*M*) - Ml)* (Prj2(jc + 0 - M2) 
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[0 0 6 5] 

<7)£^ liZfcmZti&o ZZX\ L(iMf^ft$"e* 19 , M 1 is li^tt^ 
tiM&P r j 1 is X Zf P r j 2 (D^ty-Ch <0 , V 1 £ J: <7V 2 (i^ti-e^Hi^P 

r j l iJitfP r j 2 <7>5Hft-e&£o 

[0 0 6 6] 

Cy (t) 0*7^11^9 0%J: M^t*Ui' (S 8 0 4) , 2oOlf|liy7 
hHftTii^v^ -f-^> -C&lttitf, Cy (t) Oft^:ficofifi^if^-7-b y b o 
f f ,y_fc L.-CSSil$tL .(S 8 0.5) . Mf^P r j y i <DjEmim& P r j y i r 
£*#>&7c*M;, S 7 3 30i d ^Mf^iEilJfbWt)ti^ (S 8 0 6) o 7 1/- 
A j tf7 V-A i <7)v-7 h Lfzi><DX$>tllf, 7 V-U j (T)^y7 \ ^ 7 -t y Y 
(io f f y tttho P r j y i r [*JO 1 <OW&l±-f ^Ttlf^rX h 7 
t<?*&£*i, $&*s y i fcitf&.fte y i KJ: «9^£ft& (S 8 0 7) 0 ilr 
*7sYU >W$L<omtnC a n T L <h LTfefl£n& 0 
[0 0 6 7] 

tHJ^ilr^^ >WMfrbl&tbX, t-nMatch^'OC 

WL^fih (S 8 0 8) o 7 1/-A jrt(OcSl^t£t^y7hiIf^7f 

7 y^mt. syj (c) = s y i (c) +0 f f yUi^e y j (c) = e 
yi (c) + 0 f f y(:J;t)i$*il.^ttl. (S 8 0 9) o 2oco^lSi-^> 
f^ffi-r^^ h 7^ >II^H^L-C. «f#i^ftlf (S810) o fit, 
fv7 M&ttig&3 3 4 £ffiv>-T, 2oco|g|tf^;iti-^tlWramC x (t) ^ff 
ff^ft, Cx (t) ^ft^^iS^ :ii^2o«i;jttti,it7t7 
h o f f x £ LTffiii^n^ (S 8 1 1 ) o C y ( t ) <7)flt7vHtf ? 9 0 % =t ^ 

sttntf (S812) , 2^<nmnT*xh7j >mimz~ : &Lfz^7 

V 7 4 y^Mt^^K^ -)>UMa t c hi)*l fzWHJUirZ (S 8 1 

3) e t<tOHir+X h 5>f S ^TiP* 3- v tZtifzfe (S 8 1 4£J:D f S 

8 15), — ifc Lf:v7ff + Xh7-f >1K8ttf>&J&*-r + >SMO|fc<7> 
7 0%i (S816) . 7U-A j H7V-A [ (D^7 V Ltzi><D 
t&%ZH& (S 8 1 7) o -ed-e&tftU^ 7I/-A j (±"71x-A i<Di/7 h 
Lfc7l/-Aflii^ (S 8 1 8) o 
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[0 0 6 8] 

0 3 4 l±, m 1 <7>x^X hjfttb^® 1 0 5 0M^Lt^-6, t-^X hffltti 
^■li, ^7** 7^-A<7)x y y'fftiMHitSxy v*Hffe£)S;ffl5 9 0 1, x. y 

£j£gr>9 0 2 , fel) <7)£^X Fn-^^H^t^X hn-77^;U^gri903, 

j£g|5 9 0 4 , f + X h 7 >fli-t[*](7)^l9 h n-7 £j|"IJP£i-&-f ^X h 

-~ ^ >ltfiH^9 0 5., &£&T*A h 7. < ffiWfft**** t"^X 

-<t<dt-^x h7^f >ff^:i{fiitft><7)'; x h-e&& 0 zv>f-*x h^m^fi i 

0 5^cttL(i\ -e£&/itt#< <7)^0<7)X h ^-77^a^£*LT|^£*£tl&<7)-C 
[0 0 6 9] 

EI 3 5 it, M3 4 O-i- y i?m\%L$L®M 9 0 1 ^M^^Ltt^o J- ? vfflft 
9 0 1 it, x- y v3«ft#^ 9 11, lioxy viffift£J» 9 1 2 , £> 
J; 2 7 vfflHt£j&g& 9 1 3 £#tr 0 y vSfiJSIMtSB 9 1 1 li, Sobel 

<7> j. y v Itft£J&£R 9 1 2 (±, mmMo -x. -y vS&Jt £df5e<£> x. y v L # i/Hi Jt 
$e L , -x. v 5£JSt&* L # v Mffi «t *) § if flit, 11^X7 v®it ft <7>*j-f&-t * IS 

>f f-'J^ti: LXtkM "0" £ffi^THiIjif£^LT& =U> 0 *20iyyl«4 
JM9 1 3(2. Hico^-y vMftrtOifE^^/^-T-ij^t^^r-t-^^-iij^ofim 

lift co^ y vS&jg^^x. y ^aftjs j; «9 /js £ it *tJE&1- & iB^oflt £ ±12^ 
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[0 0 7 0] 

EI 3 6(2, H3 4c7)j^ hn-?]B{t£j»9 0 2coM^Ltv^ 0 * hn 
-?®f££jCT9 0 2(2, ^Pfflft-'fi'fb^ 9 2 1 ^^f®f^Zl -fjfifb^ 9 

t-th+> 4 y K^^fflt^itCJ: 0, Niblack ^Zlflfb/jfe-C M 'T* ~7 V~ A 

i*i<7)«ljft^i:'fit'fbL-r^M3t^x h u~ ^ (DZL\%m\^^^> 0 mmmmr. 
fifb(c(2, Mtl£\ 1 l x l l (DW&k<V? 4 y K-y^ffl^ft&o 
[0 0 7 1] 

113 7(2, El 3 4<7)X h u-?y j )l?U9 0 3«^^Ltv^ c ^|>o 
J 0 3(2, X ytmmmms 3 l^ct^iS^ 

ttlg|59 3 2 4rffifx.* 0 X hn-^i 7 ytI|fM^9 3 1 li, i2<7)i7yl 

h n - ^ <h <7)m» £ ^ v ? L , M^«^(7) L § vHI\fc t) ± # it ft 

(2^cox hn-^» fn-^t^U EfcsfWtfJgtf) L §HB«fc 

tt ft (**-£>;* Fn-^^itix hn-^fcftjgfC, Fn-^^f|^x 

h n-* <h LtiitSo 9 3 2 (2, X h n - * oipi fc A $ 

[0 0 7 2] 

0 3 8(2, i3 4 0f + ^b7-f>iitM^9 0 4 0fi^tti^ 0 x 
*x h5>f>i«ffi»9 0 4 (2, X Fn-^ifx y ^9 4 1 £^ir 0 * 
^-^l^i y *SB9 4 1 (2, 2oolgf^^ h n coS £W filths £: 

^■BT^S^Sr^-i-y^-r^o "r*^ h 9 <Y >fI^B^ 9 0 4 (2, y ^ 
[0 0 7 3] 

113 9(2, i3 4©f ^ h7^f >»9 0 5«^Ltv^ 0 ir^x 
>t£f»9 0 5(2, 5 1, Fo-^t 

mg|5 9 5 2, ^i^f^r^ >&«g&9 5 3 &-£tr 0 ftftMbn-^ffl 
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g|$9 5 1 (4, 7-** >ffi.m*i<D* £ 0 £ <7>#x 

T 1 owtf^X >tI1$££j&LTV>*U4\ fn-^ <7>x h 

u-7 t LT-r-t-t&o mfe*)* h u-?tf[W,U9 5 2 (4, h <? 4 >m 

^ ^ -f - i y ^ L , fox h n-? ^^-friMftOX hn-7W^(7)L§v^ 
H4- iVh£(t*U4\ ^hn-^j|i)oxl^D-^Ht-7-ni)o -r^r 
9 5 3(4, f^fX h >Mrt-eft»IO^ hn-^^m$ 

[0 0 7 4] 

IH 4 0(4, HI3 4 Wf^rX >Hlfifl:gB9 0 6 WM^^LTV^ 0 r-^r 

X h =7 4 y~m\^ 9 0 6 (4, >f XWJtffi 9 6 l & «fc n 7 7 MffeZUfi 

lfcg&9 6 2 ££-tr 0 glft^M XfH$gB9 6 1 fi, ZffbfflO')^ > K^Ot^X 
&#Mc-*-&o 7*a y ^ MteZHI jfcgft 9 6 2 (4, H 2 »x y vHf&ft^lBafea^ 

, Nibiack (D-mit-^m^ \l^-*-7v-u \*)<nmmi%z~mit'rz>o 

[0 0 7 5] 

H4 1 ^<bEI4 6 f ^r^ hitBit<7)v^o^(7)|gf ^^Ltv^c 

H4 1(4, 1/-A^LTV^ 0 [3 4 2(4, f|!ir/II (£2 

(Dx-vVrnfe) £ tt&x- v v®ft^<7)^^^^LTv^ 0 0 4 3(4, *hn- 
?M«^Ltv^o 114 4(4, X I>o-^7^^'J VXOf^&^L 
tv^o d 4 5 (4, r^Xh7^ ^J&Ofgfc&jFLTV^&o H4 6 (4, 

[0 0 7 6] 

HI 4 7&4LK4 8 (4, II 3 5^)x.yyItM9 0 1 «7n-f^- h 
Sr^Uv^o WXHCO^OH 1 v®ftE d g e Img 1 <0®it 
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Edge Imgl ( i ) (Dlr^X <7>m.tf 0 £ ti£ (Si 101) 0 fit 

, «*0OH3&tf>ib*&ie>-C (S1102) , -x y v5Mftl§:S& 9 1 1 £ffiv>T, So 
bel i 7 ylttiSIt i ilO®f (7)x 7 y'SME (i) mmzti& (S 1 1 0 
3) o m 1 <DJ^ y ¥mi$L±J&M9 1 2 E d g e I mg 1 ( i ) 

Oft^fec^ti^o vSilJS^ScO L£ WjiT e dgei 0/v#tttL^ (S 
110 4) , Hi <7>.xy vfflftl*l<7)-<7)®^<7)iit(i 1 KSbeSfL, Edge I m 
gl (i)=lfcfc* (SI 105)o -<^^)llM(i-r^Tc7)jl^^-f-i y ? Zfi 

- - . — zi-em*) feznz (si 1 06fc«t^'s 1 1 07) o 

[0 0 7 7] 

11^7 i?Wi&tf&t>tLti&. WXHO»| 2 (Ox y vHftE d g e I 
m g 2 W^f^TOiE d g e I m g 2 ( i ) 7^0 U^JW-fb^ *i£ (SI 1 
0 8) o wrtJ}<vmmfrhM£LX (SI 109) , il^xyy'Bflrtolfw 
1-C^lf (SI 1 10) , i49 K^-f <fc 9 i coiSj^ 1 1 1 6 co@£ 

i^ftot, i£^M3IO^J.-y v^JS^i6<bn* (S 1 1 1 1) o -e-LT, 
& 2 y viBf&£j£gB 9 1 3 &ffiv>T, Ii©x y y ~sl&mt 

S 1 1 1 2) o -x-y v5#,Jt^ ? ¥^;^^ >>BlJ£«fc t)±&lttil£, »2 0xr/I« 
rtofflfm^l HSbeS-ft, ^9 -e&tttUi\ flttiO tc^$tL* 0 Iiwx 7 

.. vMfti*i<7)1"^r«oii^^i y y* ztitzfe (s 1 1 1 3 *jj:trs 1 1 1 4) , 

% 2 ox y vSift^flt^x. y v®ftE dge I mg i: LttUJ^^i^ (SI 1 1 
5) o 

[0 0 7 8] 

El 5 Oil, Hi 4 70S 1 10 3 \Z&W£>x-y vafijgfl-jf ffi 9 1 1 <Dmi^yu~ 

1116 |*JT\ ftt« «t tfiffi-x y v*&j£ Ex ( i ) iJ^'Ey ( i ) tf**5$;K «t *9 
(S 1 1 1 1 1 1 8) o 

Ex (i) =1 (d) +2*1 (e) +1 (f) - I (b) - 2*1 (a) 
- I (h) , 

tbffi#2 003-3090803 



#H 2002-378577 



^— v: 29/ 



Ey (i) =1 (b) +2*1 ( c ) +1 (d) - I (h) - 2*1 (g) 
-I (f) 

I (x) f±x#gco®* (x = a, b, c, d, e, f, g, h) <DWk 

mmzm-to vsjuke en it, >x^~x v^nztiz (s 1 1 1 9) 0 

[0 0 7 9] 

[t4] 

E(i) = jEx(i) * Ex(i) + Ey(i) '* Ey(i) 
[0 0 8 0] 

Ei4 8osi 1 1 1 liZ&tf&mm i <D¥-#jjiy ^5^jg{±. ^^«t«)tMf$n 

Medge (i) = (E (a) + E (b) +E (c) +E (d) +E (e) 

+ E (f ) +E (g) +E (h) +E ( i ) ) /9 

05 1li, h n - ? ®f&£$*B 9 0 2 cr>W)Y^y n -^ ^ - h£^L 

tv^o it, wxHwii^ h u-?mm*o^®wikz*iz> (SI 20 1 

) o fit, Ji#r®f&— -GUtSIS 9 2 1 £fflt/>t\ x h n-^®ft<7)®^<^flt^ 
5E$n* 0 MJ<9Iiit:^'b#n*!>T (S 1 2 0 2) „ xyyliflEd g e I m g <£> 
i #@<0®HcQfgE dgelmg ( i ) tfl -e&tUS (S 1 2 0 3) , mWz? V 

- ^wwftK^nmm^ftMZ^'i-b-t & iixi io^^ y Y^m%^t^, 

^Offi^&JeSft&o v®jftl^^1-^T<7)®^7&^i y * (S 1 2 

0 1 2 0 6) , X h n-^Mffc^JfcSft&o 

[0 0 8 1 ] 

El 5 2(2, 03 7(0^ Fd-^7^;^SI59 0 3«7n-ft- h^T^L 
*i& 0 IIOT^ h n-^^f,^A6T (S 1 3 0 1 ) > X h n - ? ?>i|g£ fclift $ 
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«OL#^fMAXSTROKES I ZE^itv^lf (S 1 3 0 2) , 
-<7)X h n-^(ij»£*i& (S 1 3 0 4) o It *Uf, XFd-^x.; 

yWB&&h1K&9 3 l&m^X^ ^m-^t»fx';^ti^ (S 
1 3 0 3) o ^7 FD-^iiiit^ Fn-Hti|L, f. n -?(i 
PJ?XM-K1^w„ ^hn-^«^tili\ -?-*UigiJ|&£ft£ ( 
S 1 3 0 4) o 7 Fn-^^t^ nSt roketU, 7 h D - * ®ftrt<7)i" 
-<T<7)X fn-^dtLtfx (S1305£ct^S130 
6) o . 
[0 0 8 2] 

HI 5 3 «\ H52COS1303 t,z&lf&7 Fn-^x 7 VW^M^VtU 9 3 1 

s 1 3 o 7) o m£^n&<D&m%mzionhE d g e i mgco®^t^\ * 

^<7)$^S*^^ (S 1 3 0 8) ^-^ y 7 (S 1 3 0 9) o IH4 9 Kt^LTz 

#15^.<7) A^>FnEdge^l /iltii^p-f^ (S1310) 0 nContour 

tt%*n&<D%it Lx-t^x (D%am&&i- ^ ? 7^^tz\k, mm*- ? y^^t 

tfO. 8*nContour=fc *)7C^t*Ui\ -?-CDX h u~7lt^j}7 Yu-7 
> 19 , ^fit^* h n-^ <h<^&£*i& (S 1 3 1 4) 0 ^-C&&ttUi\ 
-e<7)X h u-7{±m)]7 Yu-7Xh& (S1315) o nContour iZft 
«nEdge (DitmtMM^^LX^ho 
[0 0 8 3] 

EI5 4(i, [13 8 0t-^^ h7^f >ff*£ffM£l$9 0 4 <7)W)ft7 n - h £ 

Tf£;££*U r^^h7^ >c0^cn T L&n Stroke Uf£;£ £ (S 1 4 
0 1)o mJJ(D7 Yu-7frht&tbX (S1402) , * h n-* i (D>k <V7 Y 
n - 7 j T^ii^ tt (S1403) , ^hn-^ ifei^Fn-^ j^loo 

r^x h ^ >mmzm^^>^^^ ? 7 zti& (S1404) 0 ^x&tus 

, X Fo-7|Sfx y ^±|5 9 4 1 £ffiv>T, Z\fih<D 2 -d<D7 h d-^t)^^ 
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nTfg^S^^^ y >7 (S 1 4 0 5) o Wtmm-ehmt, 7,Yu->7 \ 

*l (S 1 4 0 6) , r-^r^ h 9 4 XDm&l tzlf ffi'JffZ (S 1 4 0 7) 0 
[0 0 8 4] 

7^f >j;S t> ..^ h D -^ j ^nf got h 7^f >CIL, Xhn-? i ^ 

£ tittle (S 1 4 0 8> S1409, S1410, iilFSHl 1) , nTL 
[0 0 8 5] 

IH 5 5f±, 0 5 4 <7)S 1 4 0 5 K&tt&X hn-^iifx7^||9 4 1 
ft7n-ft- h ^LO^o $-f\ 2 oox b u-?<DMZ h 1 & it^h 2 
£>*U Sv^Ofl5$75 s m a x h fcT-* ©VVfrOffi £;**m i n h t 

-?-?£ti%> (S 1 4 1 2) o iiii^fD-^ j O^'L^Wf ffg 

5 *m a x h <t «9^#ttnti\ £ P-h<D 2 o<7)X h n - ^ fi^^nTjtgT 
ii&v> (S 1 4 1 7) o -ed-Cfcttfiif, X h n-^ i iScfc^X hn-^ j <7>M 
Jjt&g.&W'D&y 4 y<D$fctf n Overlap^ LTffili^tL^ (S 1 4 1 4 
) o nOverlaprt s 0. 5 * m i n h =t 0 £ »t *UdT (S 1 4 1 5) , ^ 
e^2o^Xfn-^liitplitfe^ (S14 16) o *9-C&tt*UJr, £*i 
£><7)20(7)X h n-^fi^pT^-Cti^v^ (S 1 4 1 7) 0 S 1 4 1 5 tciolt<2> 
minh H*ti"-6 n 0 v e r 1 a p <oac&\fSti£&*3kt 0 
[0 0 8 6] 

05 6fi, 0 3 9 0f^^ h7>f >tfcaE8B9 0 5«7D-ft- YZtfL 
0>4 0 iT, M77^rao d f 1 a g 7& f ft£W!5S$ *L3 (SI 50 1) 0 ft 
UJOt^X h5>f>Sii**»t>i&ftt (S 1 5 0 2) , i #i (Or ^rX h 9 >fl 
^ffi $ H e i g h t ( i ) t& s #t5e<7> L # WtM INTLHE I GHTJ: 



£hfH#2 003-3090803 
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^-v: 32/ 



»t*u* (S1503) , zcdt** b ^4 yffimrtimztiz (s i 5 0 4) 0 
-e 0 -c& tt*u£\ » •) x h n - * &mffi 9 5 1 is «t uf«tft mfd-? ftajsg 

9 5 2 ^fflv^T, f% <9 co^ hn-*7&*tfcW$*i£ (S 1 5 0 SUi^S 1 5 0 6 
) oll^xfo-^«?W fwx h n-^f±glJBfc£*i (SI 507 

) , r-*xb7j>&mm>9 5 3 *m^*xmztitzxbu-?tfn&ffi.zti (s 

1 5 0 8) , ffcE7 7^j&*J*l::tS:5££*ia (S 1 5 0 9) o t-*7 
m5 9 5 3 t± % 7-*7 >&t&MJ&M9 0 4 tmffiK LXftZtitzX Yu-7 

^SMtSo t^TWf^X -y ^ (S 1 5 1 0*5 

ZVFS 1 5 1 1) , miEy ? y&WC$>ini$ (S 1 5 1 2) , fei)0^hD-V 

[0 0 8 7] 

115 7 f±, EI 5 6 OS 1 5 0 5 K*5tt&*Kft 0 X hn-?f£aiS&9 5 1 <DWlft 

ii^^ft#$n^ (S 1 5 1 3) o tlDO^ hD-^^f)J&ftt (S 1 5 1 4) 
, X i oas$**^*$ J: r>:fc#lt*Ltf (S1515) , v/i/^-r** 

h 7^ ^tB/W^tU h n-^ i <7)7£ffilJ<7)fiJi£[*]<7)X |>0-^?fi 

^ (S1516) o ^Fn-^i (Dl£m<Dmmt7- * 7 h v-T >w&p\u<om. 
£ Q Xl^o-^ i 2 oJ^±o^iMf^^ 

xfn-^i fiflgfe MFo-n^^, (si 5 2 0) o 

[0 0 8 8] 

*-?"Cfc»t*Ui\ v;vff^r^h7r/^lii^tL, XFn-^ i (7) 

£ffi<7>fl*£[*l<7)* h n-^^j. y ^ (SI 5 1 7) 0 ^Fn-^i o>fr 

®\<Dmmt, 7Yu-7 i 0&IBO«i«trat5ltcL-C5g|ft$n-5o i 
<Dfcm<7)W$Uft^ 2 oJSLho^MMt-r** h7^ 7 Y u 

-7 i ttflEfety X hn-^tiS (SI 5 2 0) o £ <03MI l±-r*X b 9>f >fl 
J&rt^-r-s-co* hn-^tfi ? * $n^i-eiiOii$n<?> (S 1 5 1 8 is J: 
CFS15 19) o 
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[0 0 8 9] 

H15 8 (i, H57<^S1516i3«tt>*S1517 KiSlt* v;i^-r** b 7^ 
>»7D-ft-K/TLtV^ 0 if\ f ; lrXh7'f>I^MS|5 9 0 4 
t|Hpf$KLTX hn-^^M^^ta 0 -r^rX F7^^»tnTe x t L i 
ne^U 0*§lf*U£ (S 1 5 2 2) , ^3owiOT/;^ti^»^ 

fx ? £*l&o 

1. 2o<0^il|f + 7 (SI 5 2 3) o 

. . _. _2„. ..rB? ^X h 7 > & 7 f^Xh7^> m&<D±K $>Z> ( S 

1 5 2 4) o 

3. #-r*x f7^ >ffl&n<r>X fn-^W3 <£ (S 1 5 2 5) 0 

h (S 1 5 2 6 ) o 
[0 0 9 0] 

@59it i56(OS 1 5 0 6 K j3tt£*te0 * h n- ^tfjgp 9 5 2 <DW)ft 
7D-ft-F^Ui/^ 0 $i\ x^x h ^ >fIitl*I<7)X h n - ^ co^iSj 
ipl^im^tt^ (S 1 5 2 7) o «#J<7)X h o-^^ib^T (S 1 5 2 8) , 
X hn-^ i mW¥#3* Yu-^7^(D2. 5f£J: It *U;T (S 1 5 2 9) 
, $L\&&MRi) t WLfcZ*lZ> (S 1 5 3 0) o R<?)£«R. 1 e f tjsittffr 
<7)%#R. rightli, *ti*rtix ha-^i S t r o k e ( i ) 

. L e f t is i^*#S troke (i). Right Hi *l& 0 
R^iWR. t o p*5<tCfT<7)it#R. bo t t omli, ^ti^ft^cOx* 
X h 9 >f >^Jt<^>±<7)^# text line, t o p £ J: CTFO^# t e x t 1 
ine. bottom iZX. O&ScSti&o ftitiiSftR l*)G9X h U - ? <D%il) s ntW 
2ti (S 1 5 3 1 ) , -e-0^ J 3^T-eabtL{f (S1532) , X h n-^ i (i 
iil^ Fo-^i: (S 1 5 3 3) 0 £03MKli-r h ^-T 

FD-^^fx y * £*L££Ti&»9iI£*l& (S 1 5 3 4 

jsit^S 1 5 3 5) o 
[0 0 9 1 ] 

H6 OH XX/ 6 lit, ^<7>X h n-^oW^Ltv^o 116 0<7)X fn- 

mSE# 2003-3090803 
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* 1 5 4 1 fiftfe *)7sYu-? X~3b l), 06 l^Xhn-^l 5 4 2 •) * 

[0 0 9 2] 

1216 2 fi, HI 4 0<7)-r^rX j» ^ >f >^{SltgE9 0 6 w^7n-f + - h £tTc 
LTV^ 0 if, §!&-9->rXft-JfgR9 6 1 £fflv»T, f ^rX h 7 ^ >IitOj|$ 
He i ghtCio'i, ^3oo^f|:iif:tJ:^ ^zMffio^ > K«?<7) 
t^fXwh^$ii* (S 1 6 0 1 ) o 

wh=He i ght/3, 
w h = w h + 1 if w h 
w h = 5 if w h < 5 



is an even number, 



<£<D&, 7*n y ? HJftZl ffifb«B 9 6 2 £fflv>T, fwt^^ h7^^S 
ZHiHfc£*l£ (S 1 6 0 2) o 7*n -y iiS-fb^9 6 2 fi, Niblack ^-ffi 

/ lt&<D<y 4 y X&wh Hi£5gU * h o - ? Mft^JaKSR 9 0 2 t|W|«i; 

[0 0 9 33 

1 0 4isXV ! 7-^x hm&mWl 0 5^li, 0Bx.ldT, 0 6 3 Kjjc*- «t 7 &1f 
fRM^S (=J>fcf.x-*) £fflv>Tf#J&£ : }x;s> 0 1216 3^)MMtili, C 
P U (i^&L&^ffi) 17 0 1. «'] 1 7 0 2, A/^M 1 7 0 3, £ti7J§|g 

1 7 0 4 , nSRIEIS^g 1 7 0 5, ^fEfijggg 1 7 0 6, ^;t>7 -^flg 
S 1 7 0 7, jo cfcO^T-'^ATJltS 1 7 0 8 *fflfx., Ztiblt'** 1 7 0 9 CJ: 

[0 0 9 4 J 

^•=6 'J 1 7 0 2 (i, 0flx.tf, ROM (read only memory) , RAM (random a 
ccess memory) MtCffl V>£> ft-S -7°o y 7 U js X V s 7* — 9 & 

o CPU 1 7 0 lfi, ^^>;i702 ZmmLX-fuy? At&ff-t&ZtlZX 
0, M^ll^T^o i3<7)30 H^3 0 3 £ "^<0#fflJt HI 3 4 
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O901*f>9 0 6 t-e<7)#^(i, ^Vl 7 0 2 (cf£|fl $ "7°n ^ A 
[0 0 9 5] 

A7J^ffi 1 7 0 3lt fOx.{i\ K, tf4 >t--< >^T-Vsvf^ N ? 

4(2, 0Ox.(i\ -ri-^7°n\ -7°<;>?, xe-^fti)!), x--if^<nffl^^ 

[0 0 9 6] 

wmznmwi 705a, tax.**, x?mw. *y§s«~ itm 

0 5(C ±teyn^^ A&ic^-^telflLT&l;, <J&^(C|£ £T, ttihi 
^<J 1 702 ten - KLtffflf^o ^IS«fi 1 7 0 5 (2, EI 1 <DWLft<D 

\£r*T~-9 i o i tfemi-zr-?^-* tLxi>m^hnz>o 

[0 0 9 7] 

^#iee!)^®i 7 o 6(2, ^mzmmwi i 1 ozmm l, -e^faii^^T 

^-t^i--l> 0 oT^HBiW#:i 7 1 0(2, ;*^'J#-K\ 7l/^y7>fVX?, 
CD-ROM (compact disk read only memory ) , itT-i^?, JtWk%-T 4 7, 

UUMWl 7 1 0(;_t|Syn^^A^J;^x-^^^LTib^, 'J^UfcDT 
, -eti^^^^'J 1 7 0 2CO- KUfltl.o 
[0 0 9 8] 

fry hi- tmWt^W: 1 7 0 7 (2, LAN (local area network) , 4 > 9 - 

fRMS^E(2, ilS^n^^AioiLKx-^ y h7-^gttl 1 7 0 7 £ 
ftLX'gtfmv) , -etL^^^^r'; 1 7 0 2 (in- KLT^fflf £ 

o 

[0 0 9 9] 

mAMtl 7 0 8(2, #!jx\(2, ilOTVt*ft*^7 1 0 2 Xh ^ , 
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EI 6 4 li, HI 6 SOlf^a^S^^n^^A^i^f-'-^^^i-^^iiO 

O^-tf— 8 0 1 1 8 0 3 [z^m^^tzy'uVy l*&£ZfT- 

?fi> ff^M^Sl 8 0 2O^t'J 1 7 0 2(:o- KS*i& 0 "9— /< 1 8 0 1 

Offii^Ii^S:^ LTlf f M^f 1 8 0 2 tc^lf 1"£ 0 CPU1701I± 
. . _ [0 10 0] 

i^rl" * m 2 <0 -7 W - A |5£*^|£ £ , 

Srfltx. £ £ <h tt-Sf^ffx>y'7l/ - AtfUtimWo 

[0101] 

(#12 2) «fSE*107l^-A|fe*^gli, 

mtttz.htitz\£Tir'7 v - u <r> n % <n 2 o©ifft7 I/- AftoiHj Cfigi; 
^TT&&^S^£^«Mftrn y tmWt^WLt, 

mffe^rn 7^^7<7)2 0(7)ifi/D y fomumzniMLx, 120^ 

ft r n y ? LTv^^«r 8w£1" & OS ft ^ n y * «U£tHM^& . 

t^7o y ^^TO|f:m^^i"^«®ft^C7 y ? <D&<7))£&m^XMU2 



ffiffiiRF 2003-3090803 
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mum i <r>-y itm7u-AOT^m7i/-A 

llto 

[0 10 2] 

mum i cozimmmvymmzt mm&zm^xr-** y u >fwofsa^^ 
mum i <vzLiMm\%tm2 <D~mm\%.(D^t^ r-^-^Yy^ ymM^<Dmmco 

[0 10 3] 

(ttffl 4 ) hu f fi^ 3 CD "7 U — A P£*3M& ti > 

v<Dm^y^y Yt, m2^<D\£rtyu~A(omyj^]omiimtak^'t^m 

*§HQ£ffl^T, HyfSlJftv^ h i2oom71/-A(7)| 
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-t h %M & t: r-*^- 7 1/ - A <h L T Bfc*-f * -r * # & -r £ # s a 1 1E«<0 t * x 
[0 10 4] 

mu^-fu 7^^7^2 0(7)ifi7'n y ?<7>MMmznnLx, m2^<Dm 

ft r n >y * j&n8*R L T v ^ * ftSfei - * ® ft 7 n y ^ «{MftiIJ¥l£ . 
ti7"n y ^^T<o|£|fc4c^&»l®ft-7*n y ? omcoittm^'XifiU 2 o 

[0 10 5] 

(#156) ^-x.^tL/^lfT-^ 7V-A^t,fn h 3>x>7 £^tHflk?> 

mt^x.(biifz^T^y u-&(Do t>(ni -o(D\£7*Jr~7 v-i*<dW> i -fit® 
ft^^i-^i^iSffi^Sft-ftfb^i:, 

huies i <n-mmi%<Dffimztmmzzm^x7-*x h7>f>««oei«:ft 

lMit-?.eft7l/-A^f^X hfH££^-£ &V^f-** h7l/-Af 
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[0 10 6] 

m tmm^nm % m ^ x , mm 2^<D\£r*7u-j,<Dm^&iiz>mi%><7 
b<Dm*7-ty b m2^<D\d~r*7u-^<Dmi]fa<vmvmtzfe^&m 

TOSrffl^T, fEMf|y7 b<Dffi*7-t: y b <h, 12^(7) tfT*^" 7 1/-A 
[0 10 7] 

(#ifi8) 5-z.<btitzWi%frb'j?% < ti>i -dcdt-** by 4 >mt&*wm 
huis-x- v ismmzm^xmtt^htifzmi%\H<DMnj:^x bu-? <n~m 
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- - . [0 10 8] 

(ffrSB 9 ) hu ffix y ^®ft^^S(i , 

sobei x y v^mti^ffl^rfriE#£ utimi^^mm^^ v vfemzm 
<D'*4 -jr v m^wfc-t z> t\z x. w>w> i <7)3-v ismi%*£.f&-tz>m 1 oxy 

S^¥*^x. y ygftjft J: 0 /Jn£ &t*U£\ fgittDi-^M^ojg^ iEio;^ i; 

icRjgt* - 1 iz x y) , mm 2 ox y » 2 ox -y vint^ 

[0 10 91 

(#SB 1 0 ) Jttrffi* Iift£/&¥J£fi> lftiB& 2 Ox -y ^HflfertOflT 

<t^ Nibiack <D~mtiim^m^^hfitzmi^imm\^^mitLxmu 
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[0 1 10] 

(ftSEl 1) HtrfE^ fn-^7 0V^tli, 

h n - ? con-fit !If£|*|<7) * hn-? co|&fP t <DM.mm ^fx7H> l£fi*t^ 
p^<0 L^vMiti ^£tttU;f|£x Fn-^^t^ Fo-^^L, f£M*g 
WSCpJtteCO L £ Wit J: /h £ it^UifflCX h n - ? h n - ^ t ^Lt 

- - - Htrta^. > u - ? cofi t $ $ £ ffl y > x ± p ^Fn-H B&*-r * *E»lfcttJ^ 

[0 111] 

(ftfE 1 2 ) gffS-r^rX fy^ 

[0 112] 

(tt§E13) HufE-r^^ h ^ ymmm^mt. 2ocoPt^1-^^ hn- 
[0 113] 

(ftlBl 4) HUfEx^r* f7^>t!E»> 
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HUpSt" >f!ft[*I<7)X hn-^oWfiUJ: «?&^$ft£ L§v^«t 

£ # tr d t i" £ ftlE 8 lefcco f + X f tf| tti^go 

.......... [0 1 14] 

(ffrKl 5) #x.£ftfcBf£rt>'b^&< ti> lowf + x 

HT x. f> ft S ft CO y v ® ft Sr & J- y v Bflfc£l£¥& t , 

Htfffi-x y v®ft^ffl^rmfE^^^ft7t:liftI*Icof^M^X b o- ? <D~mM 

HtTlfii y v®ftrtcoi y v*^i-B3HfcKrfB«l«5:^x h n - * cozHfiBftl*! 
cox h n-^ottJBfcoMiJfsfc^i 7 7 L, MfiW^THCoLI'V^iiJ: J) tc 

i ttftrax h n - ? hn-^t^i, mMmm^mm^ l # ^nt 

«t O'hSltfttiHK* fn-^^iix hn-^t^Lt, f£»x h £ 
[0 115] 

(ffrKi 6) ^•x.^ft7tvfr ? ^-7 p-A^f,f+x h ^yy-yyZ'&tsmk 
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mztifz^T-**? u-Azmmr-** v^^^vyv-At Lxmtii-^ 

[0 1 16] 

(ftSEl 7) itfffS-^-x. ^tLf-tf t-*^-"7 Ix — A CO 9 £>C0 2 o<^tf-f*^-7 — A 
- ^ 7 ' D 7 * LT>/^ ^^L, 

a - * K#tff $ -r t n& t -r aftiB i 6 ie«o y n 

[0 117] 

(ttffi 1 8) mflfi-^-X. ^tL7t Vf r-*^- 7 lx — A CO 7 1 0<5kf7't7 lx — A 
1 aOilM-T -5> tf t"'^~ "7 U — A * * h &V^-r** h^l^-At? 

lei 6ie«coyn^9A 0 

[0 118] 

(#fE19) f)IS-7 0 n^7A(i N 
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m um^comm £ m v > X , mi IS 2 oco if 7"^- 7 - a <d m: & it %> ifflft -> -7 
[0 1 19] 

(#|E2 0) -^-X.'bfLfctfT' *7 V-A^^-r^rX h 3 > X > U> £#&1t^ 

2 ocoijftyn y ®ft^ >t- ^^O^b^^i-fbTJ^^^^^r n >y ^ 

HulSW^rn 7M7(D2 ooMftrn y ^ oSMRJS Lt, K 2 oo® 
ft 7* n -y ? 7^mm Ltv^^S^^r L > 

£ fu f S n > fcf - * tc^r $ -£ & d t £ <h f & 7° n ^ 9 A 0 
[0 12 0] 

(ffrSE2 1) ^z.bntz\£r*-7V-Afrh7-** Y*>7->v*&tt%m. 
<D \£y*3r yV-U %miR1r2>a y\£a--9<Dtztb<F>y°uf7^ x*h o T, 

mtttZ-btltz\ZTj-y V-A(Do ^<D 1 O(0t"f*t7 lx-A<7)fj| 1 COZlffilJ 

Htjie^ i <D-mwm<7)mmztmm&*m^xy-** hy^ymm^am^rk 
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Afflux** h "7 U-ASr^*L^»C5S$tL^e^7V-A*|9l»^"^ 
[0121] 

(ftfB2 2) #;L^n^tff *7 lx-A^f-^x h n >y- > "7 £#tH®2& 
CO 7 V- A ^m^h 3 > tf a - 9 <Dtztb<D-7°U ^AT^ot, 

fflH*fflV>t, lWIBH«v7 hOft7-t 7 h ^2o<7)lf-r^-7W-A<7)«| 
» t r'i 7 V - A ^ Lf:t $ tit If ft 7 1/ - A If ^rT, 

[0 12 2] 

(#12 2 3) #x.<btL7t:®f^^^^< loor h 9>f 
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fj I £i y v If ffc £ ffl v > TMffi— fit®ft^ ^41)<0^ hn-^I^L, 
[0 12 3] 

(#IS2 4) sobei ^? i>i%&&zm^xmtt^hnfzmi%ft<nmm&<7) 
vHi<t it*u*\ si 7 vi<gi*i wtsitois-jb-syu t u § 

[0 12 4] 

(ffrSB2 5) BUIBH2 0x7 v®ftft<^pfS&&^ ^'Jtefc^rf <5>#®3!l 
OffiHSr^t-r&^-f > K*«:fl3v>&£iKJ: J), Niblack OZMIffc^-Cpr 

tBfiE#2 003-3090803 
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[0 12 5] 

HufEm^^Rf^^ L § X *) # ttftUTfufE^ h o - * b n - ^ 

iiuiE»^ h n - ^ «:|^*i-&fttaS:fflriB3 > fc? 9 KlSfT$-£* d h 
ft© 2 4|E«oyn/7A 0 
[0 12 61 

(#IE2 7) ~mtmw 4 y k^wx^^l, 

-T^pfE-? > K^Srfflv^i blZX I), Niblack orfliffc^&^ffirfE-^x. <b*i 
/c®ft^^®ft^-^fki-^^S^H>rfErr > tf ^- * KHfx £ £ 

ti"*ftSE2 4fE*c^7°n^^A 0 
[0 12 7] 

(#IE2 8) 2 o^iif^^ h n-^(0H$Wlt$i:i2oO7 l h n- 
[0 12 8] 

(#IE2 9) iftfEx** h 9>f >^Jt[*IO* h n - ? ospjSjg; £ «fc 
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buIBt-** h 9 >fnti*i-e^^ (7)x h n-?^ft$tLtiy:\ if+x h 9 

fcf ^ - ? $ & - t £ £ -T & #|B 2 3 f Sic<7) 7 'n^7A 0 

[0 12 9] 

dm 3 0 ) f>tL7t®ft7)^^^ < t 1> 1 O^f^rX h 7 J y^M^Nl 

&7EU Rm«« s RBr5eOL£vMiL.fc «}/h£»t*U*ifcx Fn-^|^ 
— 7 

WufB^M^^ h n-^(7)-fSiift[*I(7)^$n7t:^ h n - ? (7>|f #£tB7j1-;z> 
Ma^HufEn > tf a. - ^ fcHfrS-fr-S - t *m$kt't&-7u 7*7 A c 
[0 13 0] 

(D tf x * 7 1/ - A £ 31 K "t * 3 > kf - * O *i> co 7° o $r ^ a £ IB 11 L I B IIS 

16tU J B-x.bifitzVr'*y hffit&*&i fcufcT ^7 P-A £ 

IffS-^x. t>tLfetTr** 7 U-A^c, lift v 7 h tctH@-t^%ft^^T-^-7 lx- 
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[0131] 

(#IE3 2) ^•x.^ti^lfT'^^W-A^ibT-^rX h n > -f > 7 ^tflttfc 
O \£ t* * 7 V - A £ 51 ^ -f £ > e jl - * O *i> <D •? u V 7 A £ f S#i L i f Sllig 

-y ?^70 2oolt7'n y * <7)«JS£ftJ§: Lt , |2ow| 
ft ■/ n ? * ^»JL L T ^ & fr&fr & L , 

f^7'D -y ^^T^^.m^^i-^SmSftrn y ^<7)mojrt^fflv^Tfrfe2o 
<7> tf -f* -7 1/ - A L T v> 2> ir&in £ fejE L > 

«f fft 7 1/ - A ^ L /:f ^ tit t r*t 7l/-A^tIf^r^ h 

[0 13 2] 

(#133 3) ^^ti^tffi7 W-A^tj-r^r^ h 3 ^-r ^7 
mrfS-^-x.^^7 fc v:H-r^-7 1x-a<7)^£><7)1 ootfr^? 1/ — A«| 1 OZUfi® 

mfisii i <o-mwm<7)ffimztmmzzm^x7*x 

nine* i <D~mmi%tm2 <D-mmi&<omt. 7** v 7 4 ymi&ft<omm<v 
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ii^ huib^ > t? a. - * tc^T£-£& - 1 *mwt-tz> n > tr i - o wr 

[0 13 3] 

mwMfmmm *m v> x > mriE 2 00 if 7-** 7 u< - a (7) m u & tt & nit -> 7 

fflM^fflV^-C, HUf5®ft^>7 hO|f*7-try h fc, g2owifft7I/-A©i 

tea* Huis=f > fcf i - 7 iciifi 1 ^-^ & £ t fc -r & 3 > e * - ^ m&m 0 
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